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■ Abstract. Geometric semigroup theory is the systematic investigation 

of finitely-generated semigroups using the topology and geometry of 
their associated automata. In this article we show how a number of 
easily-defined expansions on finite semigroups and automata lead to sim- 
plifications of the graphs on which the corresponding finite semigroups 

04 . act. We show in particular that every finite semigroup can be finitely 

expanded so that the expansion acts on a labeled directed graph which 
resembles the right Cayley graph of a free Burnside semigroup in many 

■ respects. 
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1. Introduction 



Geometric semigroup theory is the systematic investigation of finitely- 
generated semigroups using the topology and geometry of their associated 
automata. An early example of this approach is the article [24] where the 
first author proved that the Burnside semigroups 



(for fixed m > 6 and n > 1) are finite ^-above, have a decidable word 
problem, and their maximal subgroups are cyclic. In addition, and perhaps 
most importantly, the Brzozowski conjecture — that the equivalence classes 
of elements form regular languages — was verified in this range. Independent 
proofs of these results were obtained by A. de Luca and S. Varricchio [10] at 
about the same time, and shortly thereafter additional cases were covered by 
A. do Lago [H] and V. Guba [16]. The techniques used in these other papers, 
however, were predominantly combinatorial in nature. In the present article 
we wish to generalize the geometric nature of the arguments used in |24j . In 
particular, we will prove the following: 

Theorem 1.1 (Rough statement). If S is a finite A-semigroup, then there 
is a finite expansion of S which acts faithfully on a labeled directed graph 
which has many of the nice geometric properties possessed by the right Cayley 
graph of a Burnside semigroup B{m,n), m > 6. 

In the course of the article, we will explicate the particular expansions 
involved and the precise nature of the resemblance. 

The proofs in [24] involved a detailed examination of the automata which 
recognize the equivalence classes of words in under the relations defining 
a Burnside semigroup. In particular, for each word w G A~^, the equivalence 
class [w] of words equal to w in B{m, n) was described as the language 
accepted by a non-deterministic finite-state automaton with a fractal-like 
structure. The deterministic version of this automaton is a full subautoma- 
ton of the right Cayley graph of B{m,n), and its geometric properties were 
described in some detail by the second author in |30J- The main theorem 
will follow immediately once we have shown that every finite automata can 
be finitely expanded, so that the expanded automata closely resembles these 
"McCammond automata" as described in [24J and [30]. 



We view here automata from two angles: as labeled directed graphs and 
as universal algebras. The former viewpoint is geometric, whereas the latter 
is algebraic. This section establishes the essential properties of the topology 



B{m,n) = {A\x'^ = X' 



.m+n 



2. The topology of directed graphs 
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of directed graphs that we shall need. Probably nothing in the first two 
subsections is original although maybe our slant is different. To some extent 
it follows [37] . 

2.1. Directed graphs. We begin with the definition of a directed graph. 

Definition 2.1 (Graph). A (directed) graph T consists of a set V{T) of 
vertices, E{T) of edges and two maps i, r : E{T) V{T) selecting the initial, 
respectively, terminal vertices of an edge e. Often we write e: v ^ w to 
indicate i{e) = v and r(e) = w. 

Directed and undirected paths in a graph are defined in the usual way. If 
p is an undirected path, then l{p),t(p) will denote the initial and terminal 
vertices of p, respectively and we shall write p: i{p) — )• t{p). We admit 
an empty path at each vertex. When we say "path" without any modifier, 
we mean a directed path, although we may include the word "directed" for 
emphasis. A directed or undirected path is called (vertex) simple if it visits 
no vertex twice; empty paths are considered simple. By a circuit we mean 
a non-empty closed path p (i.e., i{p) = t{p)). A circuit is called simple if 
the only repetition in the vertices it visits is when it returns to its origin. 
An undirected path is called reduced if it contains no backtracking (i.e., no 
subpath of length 2 using an edge first in one direction and then in the 
other). The inverse of an undirected path is defined in the usual way. 

A graph is connected if there is an undirected path from any vertex to any 
other. A connected graph is called a tree if it contains no reduced undirected 
circuits. By an induced or full subgraph we mean a subgraph obtained by 
considering some subset of vertices and all edges between them. 

There is a natural preorder on the vertices of any directed graph. 

Definition 2.2 (Accessibility order). Let F be a graph. Define a preorder 
on ViX) hy V w \i there is a path from w to v. A graph is termed acyclic 
if ^ is a partial order (equivalently, there are no directed circuits in T). As 
usual, an equivalence relation can be obtained form -< by setting v ^ w ii 
V < w and w ~< v. A strong component of a graph is an induced (or full) 
subgraph of T obtained by considering all vertices in a ~-equivalence class. 

A graph F is strongly connected if it has a unique strong component. In 
general, the set of strong components of F is partially ordered by putting 
C > C" if there is a path from a vertex in C to a vertex in C . 

It is convenient to divide strong components into two sorts: trivial and 
non-trivial. 

Definition 2.3 (Trivial strong components). A strong component can con- 
tain no edge. In this case, it consists of a single vertex and we shall call it 
trivial. Hence by a non-trivial strong component, we mean a strong compo- 
nent with at least one edge. In particular, a strong component with a single 
vertex and one or more loops edges is considered non-trivial. 
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Remark 2.4. Notice that if v ~< w, then there is a simple path from w to 
V. Indeed, let p: w ^ v he a minimum length path. If p is not simple, we 
may factor it p = uts with t a directed circuit (in particular t is non-empty) . 
Then us: w ^ v is shorter than p, a contradiction. So p must be simple. 

Other authors order vertices via the opposite convention. Our choice was 
made to be compatible with Green's relations in semigroups. That is, the 
accesibility order on the right Cayley graph of a semigroup (to be defined 
shortly) corresponds to the <^ ordering on the semigroup. 

2.1.1. Preordered sets. It is convenient here to introduce some terminology 
from the theory of preordered sets. If (P, <) is a preordered set, then a 
downset is a subset X Q P such that x (z X and y < x implies y (z X. 
Downsets are also called order ideals by some authors. If y C P, then 
denotes the downset generated by y. A downset of the form p-^ with p £ P 
is called principal. One can define upsets (called filters by some authors) 
dually. The upset generated by Y will be denote Y^. A subset X of a 
preordered set P is said to be convex ii x ~< y ~< z and x, z E X implies 
y £ X. The equivalence classes of P are defined by p ~ g if p < q and q < p. 
A preordered set P is a chain if any two elements of P are comparable. 
A principal series for P is an unrefinable chain 



of principal downsets. Every finite preordered set has a principal series. A 
principal series for a poset amounts to a topological ordering of the poset. 
In general, one can verify that Pj \ Pj+i is always an equivalence class of P 
and that every equivalence class must arise this way. 

Suppose now that V is the vertex set of a directed graph F and order V by 
the accessibility order. Then a downset in ^ is a subset X of vertices with 
the property that if the initial vertex of an edge belongs X, then so does the 
terminal vertex. The equivalence classes of the accessibility ordering are the 
strong components and a principal series amounts to the same thing as a 
topological ordering on the strong components. Therefore, when we assign 
indices to the strong components, we use notation consistent with (j2.ip . 

2.1.2. Transition edges. An important role in the theory is played by those 
edges that go between strongly connected components. One facet of geo- 
metric semigroup theory is to simplify the structure of these edges. 

Definition 2.5 (Transition edge). An edge e of a graph F is called a tran- 
sition edge if T(e) oo i(e) (or equivalently, there is no directed path from 
r(e) to i{e)). The frame Fr(F) of F is the graph with vertex set the strong 
components of F and edge set the transition edges of F (i.e., we contract 
each strong component to a point). If e is a transition edge, then e starts at 
the strong component of its initial vertex and ends at the strong component 
of its terminal component. Evidently, Fr(F) is an acyclic graph. 



P = Po D Pi D • • • D Pr 



n 
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The edge set E oi a directed graph F is also a preordered set. One can 
define e ^ / if either e = /, or there is a directed path in F of the form epf 
where p is some path. Notice that distinct edges e, / are equivalent if and 
only if they belong to the same strong component. Transition edges are pre- 
cisely those edges belonging to a singleton equivalence class. In particular, 
the transition edges form a poset. 

It is particularly important in geometric semigroup theory when the 
strong components form a chain. 

Definition 2.6 (Quasilinearity). If F is a directed graph we say that F is 
quasilinear if the natural partial order on its strong components is a total or- 
dering. Vertices in the top-most strong component of a quasilinear directed 
graph (if one exists) will be called top-most vertices. In a finite graph, when 
the strong components are linearly ordered in this fashion we will number 
them starting with and beginning with the top-most component to be 
consistent with (|2.ip . 

Of course, a quasilinear graph is connected and every strongly connected 
graph is quasilinear. The class of quasilinear graphs for which the transition 
edges form a chain plays a salient role in geometric semigroup theory. In 
this case Pr(F) looks like a line, whence the following terminology. 

Definition 2.7 (Linearity). A quasilinear graph F is said to be linear if its 
transition edges form a chain. 

Remark 2.8. If F is a finite linear graph with exactly k+1 strong components 
(numbered 0,1, ... ,k), then it has exactly k edges which do not belong to 
strong components, (which then necessarily connect the (i — 1)** strong 
component to the i*'^ strong component, i = I, . . . , k). 

Definition 2.9 (Entry and exit points). If F is a finite linear graph, the 
unique transition edge connecting the {i — 1)*** strong component to the i^^ 
will be called the i^^ transition edge. Its start point will be denoted qi-i and 
its end point will be denoted pi. Notice that the subscripts indicates the 
strong component which contains the vertex. Since the vertices pi and Qi are 
the places where transition edges enter and exit the z*^ strong component, 
we will sometimes refer to these vertices as the entry and exit points of the 
j^th component. Note that pi = qi is possible. 

li p = Pq \s sl specified vertex in the 0*^ strong component and q = qu 
is a specified vertex in the k^^ strong component, then any simple directed 
path in F from p to q will be called a quasi-base for F. Notice that every 
quasi-base for F will contain the transition edges plus simple paths in each 
strong component connecting pi to qi, and that conversely, any choice of 
simple paths connecting pi to qi, i = 0, . . . ,k can be strung together with 
the transition edges to form a quasi-base. In the case that there is a unique 
simple path from p to q (i.e., there is a unique simple path from pi to qi for 
each i) we shall call the corresponding quasi-base a base. 
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Example 2.10. An example of a finite linear graph has been schematically 
drawn in Figure 12. 1[ The shaded areas are meant to represent non-trivial 
strong components. In this example, there are 6 strong components and 
there are 5 transition edges, and their numbering has been illustrated. The 
first strong component is trivial (i.e., has no edges). The well-defined entry 
and exit points for each strong component have also highlighted. Notice 
that in this example and are identical, so that any base must use the 
trivial path to connect to q^. 




t 1 2 3 4 5 1 



Figure 2.1. A directed graph with a quasi-base. 

2.2. Morphisms of directed graphs. A morphism of graphs ip: F ^ F' 
consists of a pair {ipv,^E) of maps ipv ■ V{T) — )• V{T'), ipE- E{T) — )• E{T') 
so that (/5y(t(e)) = i{LpE{e)) and (/?y(r(e)) = r((^£;(e)) for all e G E{r). 
Normally we use (p to denote both maps. There is an obvious way to extend 
(/? from edges to paths. 

Two especially important classes of morphisms are directed coverings and 
directed immersions. If v € ^(r), then the star of v is St{v) = 

Definition 2.11 (Directed coverings and immersions). A graph morphism 
ip: F ^ F' is called a directed covering if it is surjective on vertices and, for 
each vertex v G ^(F), the induced map ip: St{v) St{ip{v)) is a bijection. 
If ip merely injective on stars, the (p is called a directed immersion. We do 
note require directed immersions to be surjective on vertices. 

Remark 2.12. Notice that if is a directed immersion, then ipip is a directed 
immersion if and only if -0 is a directed immersion. 

The following topological proposition is proved by straightforward induc- 
tion on the length of a path. 

Proposition 2.13 (Path lifting). Let ip: T ^ T' be a graph morphism. 
Then ip is a directed immersion if and only if, for each (directed) path p 
at v' € ^(F') and each v € ip~^{v'), there is at most one path q at v with 
'■Pio) = P- map ip is a directed covering if and only if it is surjective on 
vertices and, for each (directed) path p at v' £ V(r') and each v £ ip~^{v'), 
there is a unique path q at v with 'p>{q) = p. 

Notice that there is a bijection between sets and graphs with a single 
vertex. Hence we will frequently identify an alphabet A with the "bouquet" 
graph consisting of a single vertex with edge set A. A labeling of F by an 
alphabet A is then a graph morphism l:V ^ A. We can now define auto- 
mata using our topological language cf. [37]. See [EKlSlEij for background 
on automata theory. 
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Definition 2.14 (Automaton). A non-deterministic automaton over the 
alphabet A is a pair £/ = {T,£) where F is a graph and i: T ^ is a la- 
behng. A morphism of automata (p: {Ti,£i) {^2,^2) is a graph morphism 
ip: Ti ^ T2 so that 




commutes. 

An automaton (r,^) is called deterministic if £ is a directed covering; it 
is termed a partial deterministic automaton if ^ is a directed immersion. 
By an automaton or ^-automaton, we shall mean a partial deterministic 
automaton. In the context of automata, vertices are often called states and 
edges are termed transitions. If £/ = (T, i) is an A-automaton with vertex 
set Q, we sometimes abusively write ^ = {Q,A). 

By Remark 12.121 any morphism of ^-automata is a directed immersion. 
If =2/ = (r,^) is an A-automaton, then we can associate to it its transition 
monoid M{£/). Namely, we can define a "monodromy" action of the free 
monoid A* generated by A on V{T) via path lifting. If g G ^(r) is a 
vertex and w € A* then we can view w as a path p in This path has 
at most one lift p with l{p) = q hy Proposition 12.131 Define qw = t(j)) 
if p exists, and leave it undefined otherwise. One easily checks that this 
defines an action of A* on V{T) by partial functions; the associated faithful 
partial transformation monoid is denoted M{£/) and is called the transition 
monoid of A. Note that the action is by total functions if and only if £/ is 
deterministic. We denote by r/^ the transition morphism r],^/ : A* — )■ M{£/). 
Notice that p ^ q if and only if p G 5 • M{£/). Thus the accessibility order 
on V{T) corresponds to the inclusion ordering on cyclic M(i2/)-invariant 
subsets. The subsemigroup of M{.s/) generated by A is called the transition 
semigroup and is denoted S{£/). 

2.3. Semigroups and automata. An important example of a determin- 
istic automaton is the Cayley graph of an j4-semigroup. 

Definition 2.15 (^-semigroup). An A-semigroup is a pair {S,(p) where 
ip : A'^ — 7- S is a surjective homomorphism, where A~^ denotes the free semi- 
group on A. To avoid reference to ip, we put [wjs = ^{w) for w € A^. 

If 5 is a semigroup, then denotes S with an adjoined identity / (even 
if S was already a monoid). If S is an A-semigroup, it is convenient to 
consider the empty word as mapping to /. 

Definition 2.16 (Cayley graph). Let S be an 74-semigroup. The right 
Cayley graph Cay(S', A) of S with respect to A is the deterministic automaton 
with vertex set and edge set x A. Here i{s,a) = s and T{s,a) = sa. 
The edge (s,a) is usually drawn s sa. The advantage of this "monoid" 
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Cayley graph is that the non-trivial paths from / correspond exactly to the 
words in A'^ and two such paths have the same endpoints if and only if the 
words corresponding to these paths represent the same element in S. 

The strong components of these graphs are also of interest. 

Definition 2.17 (Schiitzenberger graphs). If S is an ^-semigroup then 
the strong components of Cay(5, A) are the Schiitzenberger graphs of the 
■^-classes of . In other words, the vertex set of a strong component 
contains exactly those vertices which represent the elements in an ^-class, 
and the strong component itself is the full subgraph of Cay(5, A) on this 
vertex set. For each word w G A'^, we will denote the strong component of 
Cay{S,A) containing the vertex labeled [w]s by Sch'^(i(;) (where Sch stands 
for Schiitzenberger). Suppose [w]s = s. Since Sch'^(i/;) only depends on the 
element s and not on the word w, we sometimes write Sch'^(s) instead. The 
study of the way in which S acts on its Schiitzenberger graphs has been 
dubbed the semilocal theory. See Chapter [20, Chapter 8] and [32, Chapter 
4]. 

Directed coverings correspond to transformation semigroup homomor- 
phisms. In this paper, a partial transformation semigroup is a pair (X, S) 
where S is a semigroup acting faithfully on the right of X by partial trans- 
formations. Recall that if (X, S) and (y, T) are partial transformation semi- 
groups, then a morphism is a pair ((^, "0) where X — > y is a function and 
■0: 5 — > T is a homomorphism such that (p{xs) = (p{x)tp{s) for all x € X 
and s G 5, where equality means that either both sides are undefined or 
both are defined and equal. If 5 and T are A-generated, we shall call ((^, ijj) 
a morphism of A-partial transformation semigroups if t/^ is a homomorphism 
of ^-semigroups. 

The following lemma is standard |13j . 

Lemma 2.18. Let (X, 5) and iY^T) he partial transformation semigroups 
and if. X ^ Y a surjective function so that, for all s € 5, there exists s € T 
such that ip{xs) = (p[x)'s for all x ^ X and s (z S (interpreting equality as 
above). Then there is a unique homomorphism ip: S —?■ T so that {ip^ip) is 
a morphism. 

Proof. First we show that s is unique. Suppose t,t' G T satisfy ip{xs) = 
ip{x)t = ip{x)t' for all X G X, s G S. Let y £ Y with yt defined and choose 
X G (/3~^(y). Then ip{x)t is defined and hence ip{xs) is defined and so ip{x)t' 
is defined. Moreover, yt = ip{xs) = yt' . Similarly, yt' defined implies yt 
is defined and yt' = yt. Thus by faithfulness t = t'. Hence the element 
's in the hypothesis is unique. Define il^: S ^ T hy ip{s) = s. Notice for 
s, s' G S* that xss' is defined if and only if ip{xs)ilj{s') is defined, if and only if 
ip{x)ip{s)'ip{s') is defined and that ip{xss') = (p{x)ilj{s)il){s'). The uniqueness 
then implies 'tp{ss') = '^^J{s)^l^{s'). Finally the uniqueness of ip follows from 
the uniqueness of s. □ 
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We now verify that, for partial deterministic automata, a morphism is 
a directed covering if and only if it is surjective on vertices and induces a 
morphism of partial transformation semigroups. 

Proposition 2.19. Let £/ and £/' be partial deterministic A-automata. 
Then the following are equivalent: 

(1) A directed covering of A-automata ip: ^ .s^' ; 

(2) A surjective morphism {ip,ip): {V{s^),S{s^)) {¥{£/'), S {£/')) 
where tp is a morphism of A-semigroups. 

Proof. Suppose first that is a directed covering. We claim that for all 
vertices q oi £^ and w € one has ^p{qw) = ip{q)w (with the usual 
meaning). Indeed, the image of a path labeled w must be labeled w. So qw 
defined means ip{q)w is defined and ip{qw) = ip{q)w. Conversely, if ip{q)w is 
defined, then Proposition 12.131 implies that there is a lift of w starting at q, 
which must also have label w. So qw is defined and ip{qw) = ip{q)w. Hence 
we can define = ^^(i^') in Lemma [2. 181 to obtain -0- 

Conversely, suppose {(p, ip) is well defined. Define ip: si ^ s^' to agree 
with 9? on vertices. If e: p — ?• (7 is an edge of si with label a, then pa = q 
and so ip{q) = (p{pa) = ip{p)a. Thus there is a (unique by determinism) edge 
labeled by a from ip{p) to ip{q), which we define to be ip{e). Clearly (/9 is a 
morphism of ^-automata. Let us check that it is a directed covering. Let q 
be a vertex of £/ and let e G L~^{ip{q)) be an edge labeled by a. Then ^p{q)a 
is defined, so qa must be defined and ^p{qa) = ^p{q)a. Since the automaton 
is deterministic, there is a unique edge labeled by a emanating from q and 
it must map to e under ip (again by determinism). □ 

2.4. Rooted graphs. In this article, we shall mostly be interested in rooted 
graphs. 

Definition 2.20 (Rooted graph). A rooted graph is a pair (r,u) where T 
is a graph and w is a vertex of F such that every vertex of T can be reached 
from u by a directed path (i.e., the strong component of v is the unique 
maximum component in the ordering on strong components). 

There is an analogous definition for automata. 

Definition 2.21 (Pointed automaton). By a pointed (or initial) automaton, 
we mean an ^-automaton = (T, (.) with a distinguished vertex I so that 
(r, /) is a rooted graph, that is, / • M{s/) = V{r). We denote the pointed 
automaton by More generally, we say that a subset I is a generating 

or initial set for if I ■ M(i/) = V{T), that is, the downset generated by I 
is V(T). In this case, we indicate the generating set by writing (i/, I). 

Often it is important to endow an automaton with initial and terminal 
states in order to accept a language. 

Definition 2.22 (Acceptor). A non-deterministic acceptor over an alphabet 
A is a non-deterministic ^-automaton s/ equipped with a distinguished 
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initial state I and a set T of terminal states. The language of the acceptor 
consists of all words in labeling a path from / to an element of T. If 
we use the word acceptor unmodified, then the underlying automaton is 
assumed partial deterministic. By a deterministic acceptor, we mean one in 
which the underlying automaton is deterministic. The languages accepted 
by finite acceptors are the so-called regular or rational languages. 

An acceptor (i?/, I, T) is said to be trim if every vertex is contained in a 
directed path from the initial state to some terminal state. In particular, 
trim acceptors are rooted at /. 

For example, the Schiitzenberger graph Sch'^(s) can be turned into an 
acceptor by specifying the vertex labeled s as both the initial state and its 
only terminal state. Whenever we refer to Sch'^(s) as an acceptor without 
specifying initial and terminal states, this is what we intend. 

Definition 2.23 (Reading words). Let be an ^-automaton. Examples 
include the Cayley graph Cay(5, ^) and Sch'^(s). A word w G is said to 
be readable on si if there exists a directed path in si where the concatena- 
tion of labels is the word w. Similarly, if is a vertex si then being readable 
starting at v or readable ending at v has the obvious meaning. 

Using this language, we can restate the advantage of the Cayley graph 
Cay(S', A) as follows. Every word w G A'^ is readable starting at I, and 
because Cay{S, A) is deterministic, w is readable starting at / in exactly one 
way. 

An important fact is that the Cayley graph of the transition semigroup 
of a complete deterministic pointed automaton is always a directed cover of 
the automaton. 

Proposition 2.24. Let si = (F, v) be a 'pointed deterministic A-automaton. 

Then there is a directed covering p: Cay{S{j^),A) — > s/ of A- automata 
given by s vs on vertices. 

Proof. The map p is clearly is surjective and satisfies p{ts) = vts = p{t)s. 
Proposition [2T9] yields the desired result. □ 

A rooted graph (F, v) is called a directed rooted tree if F is a tree. Directed 
rooted trees are characterized by having a unique directed path from the root 
to any vertex. 

Proposition 2.25. A rooted graph (T,v) is a directed rooted tree if and only 
if, for each vertex w G V(T), there is a unique directed path from v to w. 
This path is necessarily simple. 

Proof. Suppose first that F is a rooted directed tree and that there are two 
directed paths p,q: v ^ w. Then by considering the longest common initial 
and terminal segments of p, q we can write p = urs and q = uts so that 
r, t begin and end with different edges. Then rt~^ is a reduced undirected 
circuit in F and so F is not a tree. Suppose conversely, that F is not a tree. 
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Figure 2.2. 

Consider a reduced undirected circuit p in F. Let w = i{p) and let g: f — ?> w; 
be a directed path. Replacing p by a cyclic conjugate of its reverse circuit 
if necessary, we may assume the first edge of p is traversed in the positive 
direction. If p is a directed circuit, then g, qp are two directed paths from v 
to w and we are done. Otherwise, we may factor p = ue~^s where u is the 
longest directed initial segment of p and indicates that e is traversed 
backwards. Let e: a b. Then b = t(u). Let r: t; ^ o be a directed path. 
See Figure 12. 2[ Then re and qu are two directed paths from v to b. We 
claim they are distinct. Indeed, if the last edge of u were e, then p = ue~^s 
would not be reduced. This completes the proof. 

If (F,f) is a directed rooted tree and p: v ^ w is the unique directed 
path, then obviously p is simple. For if we could write p = qur where u 
is a non-empty path with i{u) = t(m), then u is a reduced circuit in F, 
contradicting that F is a tree. □ 

The unique directed path from v to w \s denoted [v, w\ and called the 
geodesic from v lo w. More generally, if T is a rooted directed tree and 
w < u, then there is a unique (directed) simple path from u to w, which we 
denote [u, w] and call the geodesic from u to w. 

Let (F, v) be a rooted graph. By a directed spanning tree T for (F, v) we 
mean a directed rooted subtree (T, v) containing all the vertices of F. The 
next proposition shows that every rooted graph admits a directed spanning 
tree. One should think of a directed spanning tree as a collection of normal 
forms for each vertex of F. 

Proposition 2.26. Let {T,v) be a rooted graph. Then (T,v) admits a di- 
rected spanning tree. 

Proof. Let ^ be the collection of all directed rooted subtrees (T, v) of (F, v) 
ordered by inclusion. It is non-empty since it contains {{v}, v). Next observe 
that if {{Ta,v) \ a £ A} is a chain in ^, then (UoeA^ct''^) ^ ^ since any 
reduced undirected circuit in this union must belong to some T^. Thus by 
Zorn's Lemma, ^ contains a maximal element {T,v). We claim that this is 
the desired directed spanning tree. For suppose V{T) C V{T). We claim 
that there is an edge e € E{r) with i(e) e ^(r) and T(e) ^ V(T). For if 
this is not the case, then V(T) is a downset in the preorder ~< on 1^(F). But 
V G V{T) and F is rooted at v. Thus V{T) = V{T). So let e be such an 
edge. One easily verifies that the graph T' obtained by adjoining e to T, 
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SO V{T') = V{T) U {T(e)} and E{T') = E{T) U {e}, is a tree. Indeed, if 
w € V{T)^ then the unique directed path in T' from v to w \s the geodesic 
in T, whereas the unique directed path from v to r(e) is pe where p is the 
geodesic [v, i(e)] in T. Thus T' is a directed rooted tree by Proposition 12.251 
Clearly (T', v) is a larger element of than T. This contradiction completes 
the proof. □ 

Remark 2.27. The proof of Proposition 12.261 can easily be adapted to prove 
that if {Tq^v) is any directed rooted subtree of a rooted graph (r,f), then 
there is a directed spanning tree {T,v) containing (To,u). 

It turns out that every rooted directed graph (P, v) has a unique directed 
cover that is a tree. Moreover, this tree is a directed cover of all directed 
covers of iT,v) and hence is called the universal directed cover of iT,v). 

Theorem 2.28. Let (P,^^) he a rooted graph. Then there is a rooted tree 
(P,?^) and a directed covering vr: (P,?^) — t- (P,i^). Moreover, given a directed 
covering ip: (P',v') — )• (P,f), there is a unique morphism (necessarily a 
directed covering) tp: {T,v) — )■ (T',v') so that 



commutes. 

Proof. The proof is so similar to the classical undirected case that we just 
give the construction of P and leave the remaining details to the reader. 
The vertices of P are the directed path starting at v. One takes v to be the 
empty path at v. The edges of P consist of pairs (p, e) where p is a path 
from V and t{p) = i(e). The incidence functions are given by i{p, e) = p and 
T{p,e) = pe. The directed covering vr is given by 7r{p) = t{p) on vertices 
and 7r(p, e) = e on edges. □ 

For example, they Cayley graph of A* is the universal directed cover of 
the bouquet as well as of any other deterministic ^-automaton. 

2.4.1. The fundamental monoid of a rooted graph. If (P, v) is a rooted graph 
and V € V^(P), then one can define the fundamental monoid P*(f) of (Pju) 
to be the monoid of all loops at v with the concatenation product. For 
instance, the fundamental monoid of the bouquet SSa at its unique vertex 
is the free monoid A*. Notice that the fundamental monoid ignores all of 
P except the strong component of v and moreover, it depends on the root 
vertex even for strongly connected graphs. Thus one should really work 
with the free category P* on the graph P [23j. Analogously to the case of 
fundamental groups, the fundamental monoid of a rooted graph is free. 



(p' 
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Figure 2.3. An infinitely generated fundamental monoid 

Proposition 2.29. Let {T,v) be a rooted graph. Then T*{v) is free on the 
set of non-empty paths p: v ^ v that do not visit v except for at the 
beginning and the end. 

Proof. Indeed, q: v v is any non-empty loop, then it has a unique 
factorization q = pi - • • Pn with pi , . . . , p„ G P by partitioning q according 
to each time it visits v. Thus r*(t') is free on P. □ 

The next example shows that the generating set P^ need not be finite 
even when the graph T is finite and also exhibits the dependence on the 
generating set. 

Example 2.30. Consider the strongly connected graph T in Figure [T3l Then 
T*{v) is freely generated by the infinite set {eg'^f \ n > 0}, whereas (T,w)* 
is freely generated by fe,g. 

The next result generalizes the situation for free monoids that can be 
found, for instance, in [3]- A submonoid of a monoid M is called right 
unitary if u,uv (z N implies v & N for u,v M. 

Proposition 2.31. Let (p: (T,v) — )• {T,v) he a directed covering of rooted 
graphs. Then the induced map (p: T*{y) — > T*[v) is injective and the image 
is right unitary. Conversely, every right unitary submonoid ofT*{v) is of 
this form. 

Proof. Since is a directed covering, any loop at v has at most one lift 
starting at v and so ip is an injective homomorphism. Suppose that p,pq G 
(p{T*{v)) with q G r*(i;). Choose lifts p,q of p and q respectively starting 
at V. Then p must be a loop at v by uniqueness of lifts. Also, since pq lifts 
pq, it too must be a loop at v. It follows now that g is a loop at v and so 
Q = V'io) € as required. 

Now suppose that A is a right unitary submonoid of T*{v). Let X be the 
set of directed paths in P starting at v. Define an equivalence relation on X 
by p = g if r(j?) = r(g) and pn G A if and only if qu & N for all u : t{p) v. 
Let V be the quotient of A by = and let E consist of all pairs ([p],e) with 
e G E(T) and T{p) = i{e). We define F to have vertex set V and edge set 
E where ([p],e) goes from [p] to [pe\. Notice that [p] = [q\ implies pen G A 
if and only if qeu G A^ and so the incidence functions are well defined. The 
reader can easily verify that the maps [p\ ^ t{p) and ([p],e) i-^ e yield a 
directed covering. Surjectivity on vertices requires that (F, v) is rooted. Let 
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V = [It,]. One can verify directly that if g is a loop at v, then the lift of 
q to V ends at [q]. Now [q] = v, if and only if g = It,. Let us show that 
this is equivalent to q G N. If g = 1„, then since 1„ € A^, it follows that 
q = qly G N. Conversely, if g G A^, then the very definition of right unitary 
implies qu G N if and only if n G A^, if and only if l^u € N. Thus q = ly, 
completing the proof that T*{v) maps onto N. □ 

There can be multiple directed coverings corresponding to a given right 
unitary submonoid ofT*{v), but the construction given in Proposition l2.311 is 
the unique minimal one in the sense that all others cover it. The above proof 
also shows that directed immersions induce injective maps on fundamental 
monoids. _ 

Notice that if ip: {T,v) — >■ {T,v) is a directed covering of rooted graphs, 
then r*{v) has a monodromy action on (p~^{v) given on w G ip~^{v) by 
wp = w' where w' is the end point of the unique lift of p starting at w. This 
generalizes the action of the transition monoid of a deterministic automaton. 

2.5. The unique simple path property. Perhaps the most important 
notion in geometric theory is that of a rooted graph with the unique simple 
path property. We give several equivalent properties that will define this 
notion. Recall that path unmodified means directed path. 

Proposition 2.32. Let {T,v) be a rooted graph. Then the following are 
equivalent: 

(1) For each vertex w, there is a unique simple path from v to w; 

(2) (r, v) admits a unique directed spanning tree; 

(3) (r, v) admits a directed spanning tree (T, v) such that, for each edge 
e G E{r) \ E{T) one has [v, r(e)] is an initial segment of [v, i{e)] 
(i.e., r(e) is visited by the geodesic [v,L{e)]). 

Proof. To see that (1) implies (2), suppose that (T, v) and (T', v) are distinct 
spanning trees. Then there is an edge e that belongs to, say, T and not T' . 
Set w = r(e). Let p: u — > i(e) and q: v ^ w he the geodesies in T and T' 
respectively. Then pe: v w is a directed path in T and hence simple by 
Proposition 12.251 Since e ^ T', clearly q ^ pe. This contradicts (1). 

For (2) implies (3), let T be the unique directed spanning tree for T and 
suppose that e G E{r) \ E{T). Suppose that r(e) ^ [v,z.(e)]. The path 
[v, t(e)]e is then simple and so its support T' is a directed tree rooted from 
V. By Remark 12.271 we may complete T' to a directed spanning tree that 
evidently is different from T. This contradicts the uniqueness of T. 

Finally, we prove (3) implies (1). Let T be the spanning tree provided by 
(3). Let p = [v, w] be the geodesic in T. Then p is a simple path from v to 
w. Suppose that q: v ^ w is another simple path; it cannot be contained 
in T so let e be the first edge used by q that does not belong to T. Then 
we can factor q = ret where e G E(T) \ T and r = [u,t(e)]. But then by 
assumption r(e) G [i',t(e)]. This contradicts that q is simple. We conclude 
that p is the unique simple path from v to w. □ 
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Figure 2.4. A graph with tEeunique simple path property 
at V but not w. 

The third condition says that F is a "rooted tree falling back on itself." 
Here is the key definition in geometric semigroup theory. 

Definition 2.33 (Unique simple path property). A rooted graph {T,v) 
is said to have the unique simple path property if it satisfies the equivalent 
conditions of Proposition l2.321 If (F, v) has the unique simple path property, 
then we denote the unique simple path from v to if by [v, w] and call it the 
geodesic, just as we did for directed rooted trees. 

Remark 2.34. Having the unique simple path property depends not just on 
the graph F, but also on the choice of the root. For example the graph in 
Figure \2A\ has the unique simple path property from v but not from w. 

Remark 2.35 (Non-planar). Also, having the unique simple path property 
does not force the graph to be planar. For example, the complete bipartite 
graph is non-planar but as shown in Figure 12.51 it can be oriented to 
have the unique simple path property from one of its vertices {pi in this 
case) . 



Figure 2.5. A non-planar graph with the unique simple 
path property. 

Remark 2.36. Notice that if (F, v) has the unique simple path property, then 
each transition edge of F belongs to the directed spanning tree by the third 
item of Proposition 12.321 

It is easy to see that if (F, v) has the unique simple path property and X 
is a convex set of vertices of F containing v, then {A{X),v) has the unique 
simple path property where A{X) is the subgraph induced by X. 

The frame of a rooted graph with the unique simple path property is 
a directed rooted tree. Moreover, each strong component has a unique 
transition edge entering it, called its entry edge. The endpoint of the entry 
edge is termed the entrance of the strong component. Most importantly, if 
C is a strong component with entrance w, then (C, w) has the unique simple 
path property as does {C'^, w) where is the subgraph of F induced by the 
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downset in ~< generated by the vertices of C (or equivalently by w). This is 
summarized in the fohowing proposition. 

Proposition 2.37. Let {T,v) have the unique simple path property. Then 
(Fi{T),Cv) is a directed rooted tree where Cw denotes the strong component 
of a vertex w. Each strong component has a unique transition edge entering 
it, called its entry edge. If C is a strong component with entry edge e and 
entrance w = T{e), then {C'^,w) has the unique simple path property and 
hence {C,w) has the unique simple path property (being convex in {C'^,w)). 

Proof. We show that there is a unique directed path from Cy to C in Fr(r) 
for any strong component C. Proposition 12.25] then imphes that (Fr(r), Cy) 
is a directed rooted tree. It is immediate from the definition of Fr(r) that 
there is a path Cx — >• Cy if and only if there is a path from x to y. Thus there 
is some path from Cy to C . Suppose that p = ei ■ ■ ■ Cn and g = /i • • • /m are 
two distinct such paths; they are necessarily simple since Fr(r) is acyclic. 
The Cj and fj are transition edges of F. Since e„ and fm end in C, we can 
find a simple path in r: r(e„) — t- T{fm) in C. Also, for each 1 < i < n — 1 
we can find a simple path pi contained in the strong component of T(ej) 
from T(ej) to /-(cj+i). Let t: f — > i{ei) be a simple path contained in Cy. 
Then p' = teip2 ■ ■ ■ e„_ip„_ie„,r is a simple path from v to T{fm) whose 
transition edges are precisely ei, . . . , e„. Similarly, we can construct a simple 
path q' = t'fiqi ■ ■ ■ fm-iQm-ifm from v to r(/m) whose transition edges are 
precisely fi, . . . , fm- Since p' 7^ q', this contradicts the unique simple path 
property. We conclude (Fr(r),C^) is a directed rooted tree. 

Suppose that e, e' are distinct transition edges entering C. Say e: Ci — > C 
and e': C2 — 7- C in Fr(F). Let p,q be the geodesies in Fr(r) from Cy to 
Ci,C2 respectively. Then pe,qe': Cy ^ C are distinct directed paths, a 
contradiction to Fr(F) being a rooted tree. 

Finally, let C be a strong component with entry edge e and entrance 
w = T(e). Let z be any vertex with z < w. Suppose p, g: w ^ z are simple 
paths. Since e is a transition edge, p and q cannot use the edge e. It follows 
that [u,i(e)]ep and [f,t(e)]eg are simple paths from v to z. We conclude 
p = q, as required. □ 

The following corollary is immediate. 

Corollary 2.38. // (F,f) has the unique simple path property and is quasi- 
linear, then F is linear. 

Let us fix for the moment a rooted graph (F, I) with the unique simple 
path property and denote by T its unique directed spanning tree. Let us 
use <T to denote the accessibility order in T. Notice that u <t v implies 
u < V, but not conversely. For example, < is trivial on strongly connected 
components of F, but <t is still non-trivial. If is a vertex of F, denote 
by the set of vertices u G ^(F) with u <t v. Abusively, we shall also 
denote the induced subgraph with vertex set by the same notation. Let 
us denote by CyH. the strong component of v in v^. 
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Proposition 2.39. Let (F, /) be a graph with the unique simple path prop- 
erty and suppose v € V{T). The rooted graph {v^,v) has the unique simple 
path property. Consequently, (C^Ji , v) has the unique simple path property. 

Proof. If u £ v^, then there is a simple path in T from u to u by definition, 
which we denote [v, u]. Then [/, u] = [I, v] [v, u]. Suppose that p: v — ?> n is a 
simple path in and consider [I, v]p. If this path is simple, [v, u] = p and 
we are done. If not, then there is a vertex r visited twice by [/, v]p. Since 
[/, v] and p are simple, it follows that r ^ v and is visited by both [/, v] and 
p. But this is impossible since this means that v <t r and r <t v. □ 

Sometimes one wants to extract a quasilinear subgraph from a graph 
with the unique simple path property. If u; is a vertex of F, then by abuse 
of notation, w"^ will also denote the full subgraph of F with vertex set = 
{q G ^(r) \qyw}. 

Proposition 2.40. Suppose that (F, v) has the unique simple path property 
and w G ^(F). Then {w^ ,v) has the unique simple path property and is 
quasilinear. In particular, is linear. 

Proof. Since w'^ is convex, it has the unique simple path property from 
V. To see that it is quasilinear, let C, C" be strong components of w'^ . 
Then since Fr(F) is a tree and C, C" are between the strong component of 
V and the strong component oi w, it follows that C and C are comparable 
in the accessibility order. Thus is quasilinear and hence is linear by 
Corollary [2381 □ 

We now define the "bold" arrows of a rooted graph with the unique simple 
path property. 

Definition 2.41 (Bold arrows). Let (F,/) have the unique simple path 
property with directed spanning tree T. Then E(F) = -E(F) \ E{T) is called 
the set of hold arrows of F. (Notice the bold font is used to denote the set 
of bold arrows.) 

The set E(F) of bold arrows is in bijection with a generating set for 
7ri(F, /) (or equivalent i?i(F)) and so the number of bold arrows is just the 
first Betti number of F. Notice that if F' is any subgraph of F containing 
the spanning tree T, then (F',/) still has the unique simple path property. 
Thus removing bold arrows does not cause one to lose the unique simple 
path property. Induction on the number of bold arrows is a key idea in 
geometric semigroup theory. 

An important notion in geometric semigroup theory is that of a sloop. 
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Definition 2.42 (Sloop). If e € E(r) is a bold arrows, then the sloop (think 
"simple loop") is the path slp(e) = [I,L{e)]e. In pictures, we have 

4e)^ N 

/ > Tie) ^ 

[/,T(e)] ^ ^ [r(e),.(e)] 

where the bold arrow is drawn doubled. 

If e is a bold arrow, then the corresponding generator of 7ri(r,/) is 
slp(e)[/, T(e)]~-^. Next we define the loop of a sloop. 

Definition 2.43 (Loop of a sloop). If e G E(r) is a bold arrow, define the 
loop lp(e) of the sloop slp(e) to be the simple circuit [r(e), i(e)]e. So the 
loop lp(e) of the sloop slp(e) in the picture from Definition 12.421 is 



■ [r(e),.(e)] 

Note that slp(e) = [/, r(e)]lp(e). 

In a graph with the unique simple path property, each non-trivial strong 
component is a union of loops of sloops. 

Proposition 2.44. Let (F, v) have the unique simple path property and let 
C be a strong component of T containing at least one edge. Then C = 

UeeE{r)nclp(e)- 

Proof. Obviously, lp(e) C C for any e S E(r) n C since lp(e) is strongly 
connected. For the converse, it suffices to show that each edge of C belongs 
to lp(e) for some e € E(r) since every vertex of C must be on some edge. 
Let u be the entrance of C and suppose / G E{C). If / G E(r), there is 
nothing to prove. So suppose that / G E{T); say f:v^w. Choose a 
simple path p: w ^ u. Then q = [u, v]fp is not simple and so we can factor 
q = [u, L(e)]eq' where e is the first bold arrow used by q. Evidentally, [u, v]f 
is an initial segment of [u, i(e)] and so i(e) <t w. On the other hand, since 
p is simple, we must have v <t T(e) and so t(e) <t w <t v <t T(e). See 
Figure \2M Thus / is an edge of lp(e) = [r(e), t(e)]e. □ 

Next we wish to define the notion of a geometric rank function. Later on, 
when we deal with automata, we shall impose some algebraic restrictions on 
our rank functions, but here we define things in complete generality. From 
now on we assume that all graphs have finite out-degree. 

Definition 2.45 (Geometric rank function). Let (F, /) have the unique sim- 
ple path property. Then a geometric rank function is a mapping r: E(F) — >■ 
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r(e) 
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Figure 2.6. 



N so that, for each vertex v, r maps r ^(u) nE(r) bijectively to the interval 
[0, \t-'^{v) n E(r)| - 1]. We then totally order t'^{v) H E(r) by putting 
e <r- / if and only if r(e) < r{f). 

Said differently, a geometric rank function is a way of totally ordering the 
bold arrows at each vertex. Given a geometric rank function, we can now 
define a partial order on the bold arrows as follows. Recall that C„ denotes 
the strong component of a vertex v. 

Definition 2.46 (Order on bold arrows). Fix a geometric rank function r. 
Then define a partial order on E(r) by putting e / for e 7^ / G E(r) if: 



The bold arrows are 61,62,63,64. One has ei <^ <^ 63, whereas 62 is 
incomparable to the other arrows. 

2.6. Cutting sloops. A major idea in geometric semigroup is to work by 
induction on the number of bold arrows. To do this, we need to cut sloops. 
First we talk about cutting arbitrary rooted graphs. 

Definition 2.48 (Cutting a graph). Let {A,v) be a rooted graph. Define 
cut(A, v) to be the graph with vertex set V^(A) U {{v} x (f )) and edge set 
E{A). The initial vertex function l is as before. The new terminal vertex 



(1) Cr{f) < Cr(ey, or 

(2) Cr(e) = Cr(f) and T(e) <t rif); or 

(3) r(6) =t(/) and r(e) < r(/). 



Example 2.47. Consider the graph 
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function t' is given by 

\{v,e) T{e) = v. 

Geometrically, cut(A,t;) is obtained from A by taking each edge ending 
at V and moving its end away from v. Notice the map cut(A, — >■ A which 
is the inclusion on ^(A) U -E'(A) and which sends {?;} x T~^{y) to w is a 
directed graph immersion. 

Example 2.49. Suppose (A, v) is the rooted graph in Figure \2. 7\ Then 




Figure 2.7. The graph {A,v) be 
cut(A,t>) is the graph in Figure [278[ 




Figure 2.8. The graph cut(A,t;) 

The next proposition establishes some basic properties of cut graphs. 

Proposition 2.50. The graph cut(A,t)) is rooted at v. Moreover, the rooted 
graph (cut(A, w), has the unique simple path property if and only if{A,v) 
does. If {A,v) has the unique simple path property, then the bold arrows of 
(cut(A, t;), are the bold arrows of A that do not end at v. 

Proof. We shall repeatedly use the following fundamental observation: if 
w € 1^(A) with w ^ V, then the simple paths between v and t;; in A and 
cut(A, v) are one in the same. This is because on the one hand, a simple 
path from v never uses an edge of t~^{v); on the other hand, a path in 
cut(A,z;) using an edge e € t~^{v) can only use that edge as its last edge 
and therefore does not end in V{A). 
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It follows that ii w € V(A), then there is a simple path from f to it; in 
cut(A,'u). On the other hand, if e G t~^{v) and p is a simple path from v 
to i(e) in A, then the fundamental observation implies pe is a simple path 
in cut(A,t;) from v to {v, e). 

Prom the fundamental observation, it is immediate that if (cut(A, t;), t;) 
has the unique simple path property, then so does (A,t;). Suppose now 
that {A,v) has the unique simple path property. If w ^(A), then the 
fundamental observation shows that there is a unique simple path form v to 
w in cut(A,'u). Now any simple path from v to {v,e) for e G t~^{v) must 
end in the edge e since it is the only edge ending at {v,e). Thus the unique 
simple path in cut(A, v) from v to {v, e) is pe where p is the unique simple 
path in A from v to i(e). It follows from this discussion that the bold arrows 
of (cut(A,'u), are those bold arrows of A that do not end at v. □ 

Next we need to define a notion of a closed subgraph with respect to a 
base point. 

Definition 2.51 (Closed subgraph with respect to a point). Let (F, /) have 
the unique simple path property and let T be its unique directed spanning 
tree. Fix u G V{T). A subgraph A of F is said to be closed with respect to 
u, or the pair (A,^) is said to be closed if: 

(1) u£ AQ u^; 

(2) For all v G V{A), one has [u,v] C A; 

(3) If e G E(F) and u / T(e) G A, then e G A. 

Let us show that the set of all closed subgraphs with respect to u is a 
complete lattice. Throughout (F,7) is a fixed graph with the unique simple 
path property. 

Proposition 2.52. The collection L„ of all closed subgraphs ofT with re- 
spect to u has maximum element and is closed under non-empty inter- 
sections. Thus it is a complete lattice. The bottom of is {u} and its join 
is determined by the meet. 

Proof. First note that the graph is closed with respect to u. The first two 
axioms are obvious. The last one follows since in a graph with the unique 
simple path property i(e) <t T(e) for any bold arrow e. Clearly it is the 
largest element of L„ by (1). It is trivial that the set of elements satisfying 
(l)-(3) is closed under non-empty intersections. □ 

As a consequence we can define the closure of a subgraph. 

Definition 2.53 (Closure of a pointed subgraph). If A C u"^, define (A, u) = 
(A,^) where A is the meet of all elements of containing A. This defini- 
tion makes sense since the top of L„ is u^. 

Let us establish some basic properties of (A,n). 

Proposition 2.54. Let (A, u) be closed. Then (A, u) has the unique simple 
path property. 
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Proof. Proposition 12.39] shows that {u^,u) has the unique simple path prop- 
erty. Since ti € A C u^, it suffices to show that {A,u) is rooted at u. But 
this is immediate from Definition 12.51( 2). □ 

Consequently, (A, n) has the unique simple path property for any A C u^. 

Proposition 2.55. Suppose that (A,n) is dosed and e G E(r) with T(e) G 
A and u / r(e). Then lp(e) G A. 

Proof. It follows from the third axiom in the definition of closed that e G A. 
Therefore, [u, z.(e)] C A by the second axiom and hence lp(e) = [T(e), i{e)]e C 
A as i{e) <T T(e) <t u. □ 

Next we show that taking the closure preserves strong connectivity. 

Proposition 2.56. Suppose that A C is strongly connected and u G A. 
Then {A,u) is strongly connected. 

Proof. Let C be the strong component of u in (A,n). Then A C C since 
u € A and A is strongly connected. Thus it suffices to show that (C, u) 
is closed. The first axiom is clear. If u G C, choose a path p: v ^ u in 
C. Then [u, v] C (A, u) and so the existence of the circuit [u, v]p shows 
that [u,v] C C. Suppose that e G E(r) with T(e) G C and u / T{e). 
Then e G (A,n). Therefore, lp(e) C (A,n) by Proposition 12.55) and hence 
lp(e) C C. We conclude e G C. This completes the proof that {C,u) is 
closed. It now follows that A = C. □ 

In order to show that certain automata are trim, we need to show that 
closure preserves the property that some vertex is reachable from all vertices. 

Proposition 2.57. Suppose that (A, ti) C and w is a vertex of A which 
is accessible from every vertex of A by a path in A. Then there is a directed 
path in A from every vertex of A to w. 

Proof. Let A be the full subgraph of A containing all vertices w of A such 
that there is a path from t; to w in A. We claim that (A, u) is a closed 
subgraph containing A. It will then follow that A = A, as required. Indeed, 
by hypothesis mgACACACu''''. In particular, the first condition in 
the definition of a closed subgraph is satisfied. If u G A, then there is a 
directed path from v to in A. Since [u,v] C A, it follows immediately 
that [it, v] C A. Finally if e G E(r) and n / T(e) G A, then e G A since A is 
closed. Hence if p is a path in A from r(e) to w, then ep is a directed path 
in A from i(e) to vu. Thus t(e) G A and so e G A. This concludes the proof 
that (A, u) is closed. □ 

We can now define the cut sloop of a sloop. 

Definition 2.58 (Cut sloop). If e G E(r), then we define the cut sloop of 
e, denoted cut(e), to be cut((lp(e), r(e))). 
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This definition makes sense because lp(e) is a subgraph of r(e)'^. Putting 
together Propositions 12.501 and I2.54| we conclude that (cut(e), r(e)) has the 
unique simple path property. 

Proposition 2.59. Let e € E(r). Then (cut(e), r(e)) has the unique simple 
path property. 

It turns out that cut(e) can be described more explicitly. 

Proposition 2.60. Let e € E(r) and set u = T(e). Let Ku be the strong 
component ofu in the graph T' obtained from by removing the bold arrows 
ending at u other than e (so C C^ji ). Then: 

(1) (lp(e),^z) C Ku; 

(2) e is the unique edge of (lp(e),n) ending at u; 

(3) If f e E(r) and f belongs to {\p{e),u), then f <C e; 

(4) // / G E(r) and t(/) G (lp(e),n) \ {u}, then lp(/ ) C (lp(e),^x); 

(5) Suppose that f G E(r) n Ku- Then f G (lp(e),u); 

(6) Ku = {W),u). 

Proof. To prove (1), clearly lp(e) C Ku- It therefore suffices to show that 
{Ku,u) is closed. The first two axioms are clear. Suppose that / G E(r) 
and u 7^ t(/) G Ku- Then /.(/) <t T{f) <t u where the first inequality 
holds since (F, v) has the unique simple path property, whereas the second 
follows by definition of u^. Thus / G P'. Also / G Ku since bold arrows are 
never transition edges in P' and t(/) G Ku- This completes the proof of (1). 

We immediately deduce (2) from (1) since P' contains only the edge e 
coming into u as any edge of going into u must necessarily be a bold 
arrow. 

By (1) and (2), necessary conditions for a bold arrow / to belong to 
(lp(e),u) are that C^(/) = Cu and r(/) ^ u. Also by definition of a closed 
set, we must have that r(/) <t u. But this implies f <^ e. 

Statement (4) is just a restatement of Proposition[225]in this special case. 

For (5), if / = e, there is nothing to prove. So suppose f ^ e. By 
definition of Ku we must then have r(/) <t u. We prove that / G (lp(e), u) 
by induction on the number of vertices of the form t(/') with /' G E(F) 
and t(/) <t T{f') <t u. Suppose first that there is no vertex w with 
''"(/) <T w <T u that is the tip of a bold arrow. We claim that r(/) G lp(e). 
Indeed, choose a simple path p from t(/) to i(e) in F' (we can do this 
since / G Ku). Consider the path [u, r(/)]p. If this path is contained in T, 
then it equals [u, i(e)] and so r(/) G lp(e). Otherwise, there is a first bold 
arrow /' used by p. Since p is simple, we must have r(/) <t t(/') <t u. 
See Figure 12.91 Now u ^ T{f') by definition of F'. This contradicts our 
assumption on /, so t(/) G lp(e) and hence / G (lp(e),n) by definition of a 
closed subgraph with respect to u. 

Now we proceed by induction. Choose a simple path p from t(/) to t(e) 
in F'. If the path [u, t(/)]p is contained in T, then it equals [u, t(e)] and so 
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Figure 2.9. Factoring p = xf'y 



T{f) G lp(e) and so / € (lp(e),u) by definition of a closed subgraph with 
respect to u. Otherwise, there is a first bold arrow /' used by p. Again, 
because p is simple, we must have r(/) <t T{f') <t u; see Figure \2M Also 
the definition of F' implies u ^ T{f'). Thus t(/) <t' ''"(/') <t u and so by 
induction /' G (lp(e),n). Therefore lp(/') C (\p(e),u) by (4). Suppose that 
p = xf'y. Then lp(/')^Jr(/0, r(/)]x/' and so r(/) G lp(/') C (lp(e),^z). 
We conclude that / G (lp(e),u) by the definition of a closed subgraph with 
respect to u. 

To establish the final statement, (1) implies (lp(e), ti) C Ku- On the other 
hand. Proposition 12.441 shows that = U/gE(r')ni<:„ 'P(/)- i^) (^) 
imply that lp(/) C (lp(e),u) for ah / G E(F') D Ku- This completes the 
proof. □ 

It follows from the proposition that when forming cut((lp(e), n)) we adjoin 
a single new vertex {u,e), which we normally denote simply by u' . 

Proposition 2.61. Every edge of cut (e) is on a path from u to u' . Conse- 
quently, (cut(e),n) is linear. 

Proof. The previous proposition implies (lp(e),ti) is strongly connected and 
hence every edge / of (lp(e),n) is on a circuit q starting at u. Since e is the 
only edge entering n, the last edge of q must be e. We may assume without 
loss of generality that q uses the edge e only once. Then q corresponds to a 
path in cut((lp(e), n)) from u to u' using /. 

By Proposition 12.591 (cut(e), r(e)) has the unique simple path property. 
It now follows from the first statement and Proposition 12.40] that cut(e) is 
linear. □ 

A key property of (cut(e), n) is that it has strictly fewer bold arrows than 
(r, /) since e no longer is a bold arrow. This is important for inductive 
constructions. 

If u, V are vertices with v <t u, then [u, v] C and so we can define 
{[u,v],u). This will be used later. 

2.7. The McCammond cover of a graph. In this subsection, we present 
a construction, due to the first author, which the second author terms the 
"McCammond" expansion in the context of semigroups. In the setting of 
graphs, the term "cover" seems more appropriate and so we will adhere to 
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this terminology in this section, whereas in the context of semigroups and 
automata we shall use the term "expansion." 

If (r,/) is a rooted graph, let Simple(r,/) be the set of simple paths of 
r starting at / (including the empty path) . If p is a simple path in a graph 
and f is a vertex visited by p, denote by p[v] the unique initial segment oi p 
ending at v. Suppose that 93 : (F', /') (F, /) is a directed covering of rooted 
graphs. Then we obtain an injective map (p: Simple(r,/) Simple(r', /') 
by ^(p) = P where p is the unique lift of p starting at /'. Indeed, the lift of 
a simple path must be simple. It is then natural to focus our attention on 
directed covers if where ip is bijection. 

Proposition 2.62. Let ip: {T',I') (^,1) be a directed cover. Then 
(p: Simple(r,I) — > Simple(r', /') is a bijection if and only if ip takes sim- 
ple paths at I' to simple paths at I. 

Proof. Suppose first that ^ is a bijection and let g be a simple path at I' . 
Then q = (p{p) for some path p G Simple(r, /). But then ip{q) = p is simple. 
Conversely, suppose ip takes simple paths at /' to simple paths at /. Let 
q G Simple(r', /'); then ip{q) G Simple(r, /) and q = ip{ip{q)). This completes 
the proof. □ 

Definition 2.63 (Simple covering). A morphism ip: (T',!') — >■ (T,I) of 
rooted graphs will be called a simple covering if it is a directed covering 
satisfying the equivalent conditions of Proposition I2.62[ 

It turns out that each rooted graph (F, /) has a universal simple cover, 
which is characterized by having the unique simple path property. This cover 
is called the McCammond cover of (T,/). First we establish its universal 
property (and hence uniqueness). Then we provide its construction. 

Proposition 2.64. Let a: (F,I) — >■ (F,/) he a simple covering such that 
(F, /) has the unique simple path property. Then given any simple cover- 
ing ip: (F',/') (F,/), there is a unique morphism (necessarily a simple 
covering) ip: (F,/) (F',/') so that the diagram 



commutes. 

Proof. Define ip as follows. On vertices, set ip{v) = r(^(a([/, w ]))); this 
makes sense since q is a simple covering. For e G E{T) with e: v ^ w, 
define ip{e) to be the unique lift e of a(e) starting at Tp{v). The unique- 
ness and existence of such an edge follows since is a directed cover and 
ip{T{ip{a{[I ,v]))) = T{a{[I ,v])) = a{v). We need to check that is a 



(r,/) 



(FMO 




(r,/) 
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graph morphism. By construction 'ijj{v) = L{ip{e)). The issue is to prove 
tlj(w) = T{tp{e)) = r(e). 

The proof divides into two cases. Suppose first that [I, v]e is a simple path 
(and hence is [/, tt;]). Then ip{w) = t((^(q([/, tt;]))) = T(^(a([/, f])e)) = 
T{ip{e)) since ip{a{[I , v])e Hits a{[I,v]e) = a{[I,w]). 

Next suppose that [I,v]e is not simple. Then [I,w] is an initial segment 
of [I,v]. It follows that (f{a{[I,w])) is an initial segment of ip{a{[I,v])) since 
path lifting preserves initial segments. Next observe that (p{a[I ,v])e is not 
simple as is a simple covering and <p{i^{a[I ,v])'e) = a{[I,v]e), which is 
not simple. Hence ip{w) = r(e) is a vertex w' of ip{a{[I,v])) with f{w') = 
a{w). But since cp{a{[I,v])) is simple and is a simple covering, there is 
exactly one vertex w' of (f{a[I,v]) lying over a{w). Since (f{a{[I , w])) is an 
initial segment of (f{a[I,v]), this w' must be T{Lp{a{[I ,w]))) = tp{w). This 
completes the proof that is a graph morphism. Since the above diagram 
commutes, ijj must automatically be a simple covering. 

If /3 is another morphism making the diagram commute, then it to must 
be a simple covering. Thus /3{[I,v]) is a simple path from /' to and its 
image under is the simple path a{[I,v]). Thus ip{a{[I ,v])) = I3{[l,v]) and 
so (5{v) = i){v). If e is an edge at v, then /3(e) must be the unique lift of 
a(e) at P{v) = ip{v). This proves uniqueness. □ 

The existence of the McCammond cover (or universal simple cover) is 
established via the same schema as the construction of the universal covering 
space in algebraic topology. Abusively, in what follows we shall also use / 

to denote the empty path at I. 

Definition 2.65 (McCammond cover). Let (F, /) be a rooted graph. Define 

(r, I)'^" = (r^'', /) to be the graph given by: 

F(r*^') = Simple(r,/); 

E{l^^) = {(P, e) I p G Simple(r, /), e G E{r), rip) = i{e)} 
where the incidence functions are defined by i{p, e) = p and 



\ pe pe £ Simple(r, /) 
[p[r(e)] else. 



t{p, e) 

Define rj: (T'^'',I) — ^ (T,/) by r]{p) = t{p) on vertices and r]{p,e) = e on 
edges. The map rj is called the McCammond covering of T. The graph T^" 
is termed the McCammond expansion or McCammond cover of F. 

Since any directed cover of an ^-automaton is an ^-automaton, it follows 
that if [si/, I) is a pointed ^-automaton, then {s^^I)^" is a pointed A- 
automaton. 
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Proposition 2.66. The map rj: (T ,1) — > (r, /) is a simple covering and 
^pM= ^ f^^g unique simple path property. Hence it is the universal simple 
covering of (F, 7). 

Proof. Let us verify that the map 77 is a graph morphism. First we check 
that u{rj{p,e)) = i(e) = t{p) = rj{i{p,e)). Second, observe that T{T{p,e)) is 
a simple path ending at T(e) in either of the two cases in the definition of r. 
Thus r]{T{p, e)) = t{t{p, e)) = T(e) = T7]{p, e). Thus is a graph morphism. 
It is a directed covering since if e S -E'(r) is an edge with z.(e) = ri{p) = T{p), 
then (p, e) is the unique edge at p mapping to e under 77. 

The unique Uft of the empty path at I to V^" is the empty path at I. If 
p G Simple(r, I) is non-empty, say p = ei • • • e„, then 

V{p) = {I, ei)(ei, 6162) • • • (ei • • • e„_i, e„), 

which is a simple path in (F^", /) from / to p. Hence (F^'', /) is rooted at /. 
To prove that 77 is simple and (F*^", /) has the unique simple path property 
it therefore suffices to prove that the paths rj{p) with p G Simple(F,/) are 
the only simple paths at / in F^". 

Let q G Simple(F^'', /). We prove by induction on \q\ that q = rj{p) for 
some p € Simple(F,I). If \q\ = 0, then q = rj{I). Suppose now that it 
is true for paths of smaller length than \q\. Then q = rj{p){p,e) for some 
simple path p G Simple(F,/) by induction. Suppose that pe ^ Simple(F,/). 
Then T{p,e) = p[T{e)]. But every initial segment of p is visited by r/(p), 
contradicting that q is simple. We conclude that T{p,e) = pe £ Simple(F,I) 
and so g = rj{pe). □ 

As immediate consequences of the above result and the universal property 
we obtain the following corollaries. 

Corollary 2.67. (T^",!) is the unique simple cover of (F,7) with the 

unique simple path property. 

Corollary 2.68. Ifip: (F,/) (F',/') is a simple covering, then {T,I)^" = 

(F^/')^^ 

2.8. McCammond expansion of automata. As mentioned earlier, the 
McCammond cover of an ^-automaton is again an A-automaton. In this 
context we call it the McCammond expansion of the automaton. However, 
the reader is cautioned that the construction is not functorial. Let's make 
the McCammond expansion explicit from an automaton point-of-vicw. 

Remark 2.69 (Words and simple paths). Notice that when ^ is a pointed 
A-automaton, the words readable starting at a vertex p are in one-to-one 
correspondence with the paths starting at p. Thus, if we let Simple(=e/) be 
the words readable along simple paths starting at the root of , then =2/ 
has the unique simple path property from its root if and only if the paths 
corresponding to these words all end at distinct vertices of ^. 

Next, we interpret the action of A on the simple paths starting at a vertex. 
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Definition 2.70 (Acting on simple paths). Let ^ be a pointed ^d-automa- 
ton with root p. If u is a word readable as a simple path in £/ from p to q 
(i.e., u G Simple(i2/)) and a is the label on an edge in £/ which starts at q, 
then there is a well-defined word in Simple(=e/) that we denote u ■ a. There 
are two possibilities. If the word ua is readable as a simple path starting at 
p, then u ■ a is defined to be ua; otherwise, the path starting at p reading 
ua ends at a vertex q' already traversed by the path reading u. In that 
case, n • a is defined to be the initial portion of u which is read from p to 
q'. The two possibilities are illustrated in Figure 12.101 When u ■ a = ua 
we call this an extension, and when n • a is an initial subsegment of u we 
call it a reduction. Combining these results gives a partial function from 
Simple(j2/) X ^ to Simple(=e/). Notice that this partial function is a function 
if and only if is a complete automaton. 



Figure 2.10. Multiplying a simple path by an edge. 

We are now ready to rephrase the McCammond cover for automata. 

Definition 2.71 (Expanding automata). Let £/ be an j4-automaton rooted 
at /. Then the A-automaton ^Z'^^" is called the McCammmond expansion 



Figure 2.11. The Cayley graph of the Klein four-group. 

Example 2.72 (Klein four-group). Let G = Z2 x Z2 be the Klein four-group 
with the standard presentation (a, 6 \ a^ = iP' = [a, 6] = 1). The Cay- 
ley graph of G with respect to A = {o, 6} is shown in Figure 12.111 In 
Figure 12.121 the expanded graph Cay(G, A)^^" is shown on the left and its 
directed spanning tree is shown on the right. Notice that the expansion has 
the unique simple path property from 1 but not from the state aha since 
aba ■ b = aba ■ aba. Notice that all edges not in the spanning tree connect a 
vertex p to some vertex on the unique simple path in the tree from 1 to p. 




a 



of £/. 



Here is an example of the McCammond expansion. 



b 
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Figure 2.12. The expansion of the graph in Figure [2?TT] and 
its underlying tree. 

Our next example shows that the McCammond expansion is not functo- 
rial. 

Example 2.73 (The McCammond expansion is not functorial). We need to 
include an example. 

We conclude this section with a conjecture that has a long history dating 
back to the 1940s. We mention it here since its assertion can be reformulated 
in terms of depth of the tree underlying the expansion to the Cayley 
graph of a finite group. 

Conjecture 1 (Hamiltonian path). If G is a finite group with at least 3 
elements and it is generated as a monoid by A, then there exists a simple 
path in the Cayley graph of G with respect to A which passes through every 
vertex. In other words, there is a word w € A'^ so that every element of G 
is represented uniquely by an initial segment of w. 

3. Automata from a universal algebra point-of-view 

There is another viewpoint on A-automata that takes them to be universal 
algebras [9]. In what follows we write unary operations on the right of their 
arguments. First let's consider deterministic A-automata. Deterministic A- 
automata are exactly universal algebras with signature (or type) consisting 
of a collection of unary operations indexed by A. Given an ^-automaton 
(r, t) we can create an algebra with underlying set T^(r); the unary operation 
associated to a € A is g i->- qa. Conversely, if Q is a universal algebra of 
this type, we can form its Cayley graph with vertex set Q and set of edges 
Q X A. Here t(g, a) = q and r(g, a) = qa. The labeling sends {q, a) — )• a. 
Therefore, one has universal algebraic notions of generators, relations and 
homomorphisms for automata. We shall see shortly how they relate to our 
topological notions. 
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One can similarly deal with partial deterministic automata by formally 
adjoining a sink. Let us now consider universal algebras with unary opera- 
tions indexed by A and a distinguished constant □. We also demand that 
they satisfy the identity Da = □ for all a £ A. Let QJ be the variety of al- 
gebras of this type satisfying these identities. Then, given an A-automaton 
{T,£) we can define an algebra in this type by taking V(T) U {□} as the 
underlying set, defining Da = □ for all a € j4 and setting 



Conversely, given an algebra Q G QJ, we can define its Cayley graph by taking 
the vertex set to be (5\{n} and the edge set to be {(g, a) £ QxA\ qa ^ □}. 
Of course, i{q, a) = q, T{q, a) = qa and £{q, a) = a. Thus we can also talk 
about generators, relations and homomorphisms for partial automata. 

Let us reinterpret these notions topologically. We say that a morphism if 
of QJ-algebras is d-restricted if 99~^(n) = {□}. 

Proposition 3.1. 

(1) The category of algebras in 2J with surjective O-restricted morphisms 
is equivalent to the category of A-automata with directed covers as 
morphisms. 

(2) An A-automaton (L, i) is generated by I in the sense of universal 
algebra if and only if \ is a generating set in the sense of Defini- 
tion \2.21[ In particular, rooted automata correspond to one-genera- 
ted ^-algebras. 

(3) The free algebra on a set I is (I Xj4*)U{n} with the obvious ^-algebra 
structure. 

Proof. To prove (1), suppose first that 99 : =e/i — )• =2/2 is a directed covering of 
^-automata and define ip: V{£/i) U {□} ¥{£^2) U {□} by i){U) = □ and 
ipiv) = ip{v) for V S V{£/i). Clearly, ijj is surjective and D-restricted. Let 
us check that ip is a homomorphism. Clearly ■ a) = ■(/'(D) = □ = □ • a 
for all a G ^. Next suppose q € V{£/i) and a G A. Then by definition of 
a directed covering, we have that ip{qa) = (p{q)a in the sense that either 
both sides are undefined or both are defined and agreed. If both sides are 
undefined, then ip{qa) = □ = ip{q)a. Otherwise, tp{qa) = '4'{q)a. 

Conversely, suppose that (p: Q U {□} Q' \J {□} is a surjective □- 
restricted morphism of 23-algebras. Then we claim that 99 induces a directed 
cover defined by (p on vertices and by {q, a) i-)- {ip{q),a), for qa ^ □, on edges. 
Notice this is well defined since qa ^ O implies ip{q)a = f{qa) ^ □ as <^ is 
□-restricted. Evidentally, if gives a morphism of ^-automata. Surjectivity 
on vertices is immediate. Suppose (p{q)a / □. Then ip{qa) = (p{q)a ^ □ 
and hence qa ID since ip is a homomorphism. Thus the edge {ip{q),a) is a 
lift (g, a) (and is the unique one). 

The second item is trivial. To verify (3), if Q is a 2J-algebra generated 
by I, then the map defined by {p,w) H- pw for ip,w) € I x A* and □ H- □ 



qa = 




qa defined 
otherwise. 
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is the unique morphism from \ x A* to Q extending the map (p, 1) p for 
pe\. □ 

We leave to the reader to formulate and prove the analogous result for 
deterministic automata. 

3.1. Presentations and rewriting. It now makes sense to talk about pre- 
sentations of A-automata. Since we are only interested in pointed automata, 
we restrict to the case of one-generated algebras from 53 and so we need only 
give the relations. It follows now that if {£/, I) is a pointed automaton, then 
it is one-generated and can be defined by relations of the form u = v and 
u = n\ where u,v (z A*. The relation u = v forces lu = Iv in £/, while 
the relation u = O forces lu to be undefined. If {R) is a presentation of 
£/, the underlying 5J-algebra of £/ is the quotient of A* U {□} (viewed as 
a right ^*-set) by the right congruence generated by the relations in R and 
the relations Da = a. Formally, let us define the one-step derivation relation 
by putting Da □ for all a € ^ and uw vw ii u = v £ R where 
u,v G A* Li {□}, w € A*. As usual, =>* denotes the reflexive transitive 
closure of the relation =^. We write u ^ v for u,v £ A* U {□} if there is a 
sequence u = uo,ui • • • ,Un = v such that, for each i = 0, . . . , n — 1, either 
Ui Uj+i or =^ Ui. Then the state set of .s/ is {A* U {□})/~. The root 
is the class I of the empty word. We remark that we are essentially dealing 
with what is known as prefix rewriting systems [7]: relations can only be 
applied to prefixes of words. 
We state an obvious result. 

Proposition 3.2. Let S = {A \ R) be a presentation of an A-semigroup. 
Then the rooted Cayley graph (Cay{S, A), I) is presented as an A-automaton 
by {{ur = ur' \ u £ A* ,r = r' £ R})- 

Definition 3.3 (Loop automaton). A pointed automaton (^,1) is said to 
be a loop automaton if it has a presentation of the form {L U D) where all 
relations in L are so-called loop relations of the form uv = u and all relations 
in D are dead-end relations = □ where u,v,w € A*. 

We now prove that automata with the unique simple path property are 
loop automata. 

Proposition 3.4. Let I) be a partial A-automaton with the unique sim- 
ple path property. Then 

i/ = ({£(slp(e)) =£([/, r(e)]) I eGE(r)}U 

{£{[L,v])a = D \ va is undefined,v € V(T),a G A}) 

where £: E{T) A is the labeling function. Thus (^,/) is a loop automa- 
ton. 

Proof. Clearly jz/ satisfies all these relations. We show all other relations 
are consequences of these. Let ~ be the equivalence relation on A* U {□} 
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associated to the presentation above. First we claim that if lu is defined 
for u A*, then u ~ ^{[I, lu])- The proof goes by induction on the number 
of bold arrows in the path pu read by u from / to lu. If there are no bold 
arrows, then pu = [I, lu] and there is nothing to prove. Else we can factor 
Pu = vew where e is the first bold arrow used hy pu- If q = t{v), then 
V = [I,q] and slp(e) = [I,q]e = ve. See Figure O Thus (.{ve) ~ ^([/,r(e)]) 



e 




Figure 3.1. Reducing to the base 



and so M = l{yew) ~ ,T{e)\)i{w) = ,T{e)]w). But [/, T(e)]t(; contains 
strictly fewer bold arrows than pu and hence by induction u ~ ^{[I-, '''{&)]uj) ~ 

1{[I,T{W)])=1{[IJU]). 

Suppose now that u,v ^ A* and lu = Iv with both sides defined. Then by 
the claim u ~ ^([/, lu]) = Iv]) ~ u . \iu ^ A* and lu is undefined, then 
we can factor u = vaw uniquely with Iv defined, a A and Iva undefined. 
Then by the claim v ~ i{[I,Iv]). Hence u ~ £{[I , Iv])aw ~ □ using the 
relation £{[I,Iv])a = □. This completes the proof that £/ is given by the 
above presentation. □ 

The above proof provides a Church-Rosser type property [7] for pointed 
automata with the unique simple path property. Namely, we have shown 
that u ~ □ or ti ~ £([/, lu]) can be derived using only . Hence if we form 
a rewriting system with rules 

£(slp(e)) ^^([/,T(e)]) foreeE(r), 
i{[I,v])a n\ ii va is undefined, 

□a ^ □ 

then we obtain a confiuent terminating prefix rewriting system (where der- 
ivations are defined via =>). The unique irreducible form derivable from 

G A* is □ if Iw is not defined and otherwise is i{[I, Iw]), which from now 
on we denote Red(t(;). Here we consider □ to be irreducible, whereas a word 
w A* is irreducible if it has no prefix which is the left hand side of a rule. 

Let us formalize this. 

Definition 3.5 (Elementary loop automaton). A loop automaton £/ is said 
to be an elementary loop automaton if it can be given a loop automaton 
presentation {L U D) with the property that distinct irreducible elements for 
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the prefix rewriting system 



uv ^ u 



such that uv = u ^ L 



□a ^ □ 



such that « = 

A 



represent distinct classes of ~. 

It turns out that elementary loop automata are exactly the automata with 
unique simple path property. 

Proposition 3.6. A pointed A-automaton (si/, I) has the unique simple 
path property if and only if it is an elementary loop automaton. 

Proof. The proof of Proposition 13.41 shows that if I) has the unique 
simple path property, then it is an elementary loop automaton with the 
presentation in that proposition. We prove the converse. Suppose {£/, I) 
is an elementary loop automaton with loop presentation (L U D) and let 
N be the set of irreducible elements of A*; notice that it is prefix-closed. 
Since each element of gives a unique state of ^ and is prefix-closed, it 
follows that subgraph of spanned by the paths labeled by the elements 
of N from / is a directed rooted tree isomorphic to the induced subgraph of 
the Cay ley graph of A* with vertices the elements of N. Moreover, it must 
be a spanning tree since if g is a state of then we can find a word w (z A* 
with Iw = q. Replacing w by the unique irreducible element to which it can 
be reduced shows that T is a spanning tree. We claim that each edge e of 
£/ that does not belong to T satisfies r(e) G [I, t(e)]. Proposition 12.321 will 
then imply that I) has the unique simple path property. So suppose 
u £ N, ua ^ N with a € A and lua defined. Since ua is not irreducible 
and cannot be reduced to □, we can write ua = xyw where xy = x £ L. 
Since u is irreducible, we must have that w is empty and y ends in a. That 
is, u = xy' where y = y'a and ua = xy. But then ua = xy ^ x and x is a 
prefix of u. Thus if e is the edge labeled by a at lu, then T(e) = Ix, which 
is a vertex of [I,Iu]. This completes the proof. □ 

3.2. The standard Kleene expression. We now aim to define a unionless 
Kleene expression for the language accepted by a finite trim pointed acceptor 
(i2/, /, {q}) with the unique simple path property and a single terminal state 
q. This expression turns out to be good for pumping arguments and has 
the advantage of being unionless. The intuition is that for applications to 
complexity, one wants to associate to the automaton a unionless Kleene 
expression and then replace each occurrence of the Kleene star * by cj in 
order to obtain an element of the free aperiodic cj-algebra {25J. 

Let = (r,£). We assume that a geometric rank function r on has 
been fixed. It follows from Proposition 12.401 and {£/,!, {q}) being trim that 
£/ is linear. Clearly the vertex q belongs to the bottomost strong component 
and / belongs to the topmost one. The Kleene expression IC{s^ ,I,q,r) is 
defined via the Principal of Induction (or PI) , which is based on induction 
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on the pair (|E(r)j, |y(r)j) where N x N is ordered lexicographicahy. Let 
Co > Ci > • • • > Cn be the chain of strong components of T and let pi and 
Qi be the respective entry and exit points of Ci where we take po = I. Let 
Cj : Qi-i Pi, for i = 1, ... , n, be the i^^ transition edge. See Figure 13.21 
Notice that each {Ci,pi) has the unique simple path property. It is possible 




Pn qn 



Figure 3.2. Trim automaton with the unique simple path property 
that Pi = Qi and some of the Ci may be trivial. 

3.2.1. Base of the induction. If has no bold arrows, then it consists 
simply of the geodesic [/, q] and we define 

}C{£/,I,q,r)=e{[I,q]) (3.1) 

where i is the function labeling the edges of £/. 

3.2.2. Elementary induction. If n > 1, then we define 

)€{£/, I, q,r) = K. (i2/(g„_i),/,g„_i,r|_<^(g^_^)) ^(e„)/C (C„,p„, g, r|c„) 

(3.2) 

where J2/(g„_i) is the full subautomaton of £^ whose vertices are greater 
than or equal to qn-i in the accessibility order (cf. Proposition 12.40]) . This 
make sense since (j2/(g„_i), /) and {Cn,Pn) have the unique simple path 
property and each have strictly fewer vertices than £/ and no more bold 
arrows. 

3.2.3. Loop case. Suppose that n = (so there is only one strong compo- 
nent) and that ^ has at least one bold arrow. Let e be the bold arrow 
ending at I with largest geometric rank. Let ^ be the automaton obtained 
from by removing the edge e and taking the strong component of /. 
Then I) has the unique simple path property and has fewer bold ar- 
rows then £/. Also cut(e) has the unique simple path property from / by 
Propositions 12.501 and is linear by Proposition 12.61] Finally, we observe that 
([/, g],/) has the unique simple path property. Moreover, every vertex of 
{[I,q],I) is on a path from I to q hy Proposition 12.571 Thus {[I,q],I) is 
trim. It also has fewer bold arrows than since the graph obtained by re- 
moving the bold arrows ending at / is closed with respect to / and contains 
[I,q]. Thus we may define 

JC{^, I, q,r) = IC (^, /, I,ry) [/C (cut(e), /, l',rU(e)) ^ (^, I, I, A^s)] * 

(3.3) 

where r|(,ut(e) makes sense as each bold arrow of cut(e) is a bold arrow of 
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Before verifying that the Kleene expression yields the correct language, we 
compute some examples. In the first few examples there will be no multiple 
bold arrows ending at the same vertex and so we do not need to give the 
geometric rank function. 

Example 3.7. Suppose the automaton ^ is given by 

a 

A O 
I >Q-^ 

Then IC{^,I,q,r) = Aa*. 

Example 3.8. Suppose the automaton ^ is given by 

I —^p''^'^ q 

oil 

Then /C(^,/,g,r) = A{aia2)*ai. 

Example 3.9. Let the automaton ^ be given by 




Then 

cut(c) = q • — ^ • q' 

d 

A computation similar to the previous example shows that 

IC{cut{c),q,q' ,r) = a{bd)*bc. 

Thus JC{^,I,q,r) = A{a{bd)*bc)* . 

Our next example modifies the previous one by changing the terminal 
state q. 

Example 3.10. Suppose the automaton is given by 




The only new ingredient is to compute {\p,q],p), which is 
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and so has Kleene expression 



[{[p,Q],p),P,q,rj =a{bd)*b. 
Thus IC{^,I,q,r) = A{a{bd)*bc)*a{bd)*b. 

In our next example, we will make use of the geometric rank function. 
Example 3.11. Let the automaton ^ be given by 







and let us suppose that /3 has greater geometric rank than a. Then the 
Kleene expression is K(^,I,q,r) = Aa*(/3a*)*. 

A slight variation on the previous example is the following. 

Example 3.12. Let the automaton ^ be given by 




and let us suppose that P has greater geometric rank than 0:2 • Then 

7 

o 

cut(a2) = 1 > • > q 

and so K{£/,I,q,r) = A(ai7*a2)* [/?(ai7*a2)*]*. 

Our final example mixes several ingredients. 
Example 3.13. Suppose the automaton J2f is given by 






and let us suppose that 0:3 has greater geometric rank than (3. Then 

7 

CUt(Q:3) = P > • > q > p' 
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and 



7 



{[p,q],p) = P 



n 



Thus K{£/,I,q,r) = A/3* [{aij* 0203) P*]* (^11*0^2 ■ 

We now prove that the Kleene expression accepts the language recognized 
by the automaton. 

Theorem 3.14. Let {£/,I,{q}) be a finite trim pointed A-acceptor with 
the unique simple path property and with a single terminal state q where 
£/ = (r,^). Fix a geometric rank function r on Then IC{^,I,q,r) 

is a Kleene expression for the language of words accepted by with initial 
state I and terminal state q that does not use union. 

Proof. The proof goes by induction on the pair (|E(r)|, |\^(r)|) where we 
order N x N lexicographically. It is clear that if has no bold arrows, 
then IC{s3f, I, q, r) is unionless and accepts the same language as £/. Simi- 
larly, in the case of the elementary induction 13.2.21 one easily verifies that 
/C(i2/, /, q, r) is a unionless Kleene expression accepting the language of ^ 
using the induction hypothesis. The only difficult case is the loop case 13.2.31 

Let us retain the notation of 13.2.31 By induction we have ^ with initial 
and terminal state both set to / accepts precisely /C(=^, /, /, r|g), whereas 
/C (cut(e), I, r|cut(e)) accepts all strings reading from I to /' in cut(e) and 
/C(([/, g], /), /, g, rj^^y^j-^) accepts all words reading in {[I,q],I) from I to q 
Then trivially, JC{^, I, q, r) is accepted by ^ with initial state / and final 
state q and is a unionless Kleene expression using the induction and ()3.3p . 
The converse direction requires some proof. 

Suppose w labels a path from / to q. We can factor w = uv with u,v G A* 
so that In = I and the path labeled by v from I to q does not revisit /. First 
we claim that v is accepted by ([/, g],/) with initial state / and terminal 
state q. If g = /, then v is empty and there is nothing to prove. Otherwise, 
we proceed by induction on the number of applications of needed to 
reduce v to l{[I,q]) in the rewriting system appearing after Proposition 13.41 
U V = £{[I,q]), then there is nothing to prove. Suppose v ^ x. Then we 
can write v = ^(slp(/))z for some bold arrow / with x = i{[I,T{f)])z. By 
choice of v, the edge / does not end at /. By induction, x is accepted 
by ([/, g],/). Since / is a bold arrow with t(/) visited by x on its run 
from /, it follows from the definition of being closed with respect to / that 



/ G {[I,q],I). Thus lp( /) C {[I ,q],I) by Proposition Ther efore, 
slp(/) = [I,T{f)]\p{f) C {[I,q],I) and so v is accepted by {[I,q],I) with 




It thus remains to show that 



u G /C(=^,/, 



/,r|^) [}C (cut(e),/,/' 



r\cut(e)) K^{^,I,I,ry)] 



38 



J. MCCAMMOND, J. RHODES, AND B. STEINBERG 



Let s be the path read by u from I to I and factor p = soesie ■ ■ ■ Sm-ieSm 
where no Si uses the edge e and where we interpret the case m = as 
p = sq. Then i{sm) is accepted by S§ with initial and terminal state /. We 
may further factor each Si with 0<i<m — lassi = tiUi so that : I ^ I 
and so that Uii I ^ i{e) does not revisit /. Then £{ti) is accepted by <^ 
with initial and terminal state I for all < z < m — 1. We claim that i{ui)a 
is accepted by cut(e) with intial state I and terminal state I' . If / = t{e) 
(that is, e is a loop edge), then each Ui is empty and cut(e) = 1-^1' and 
so there is nothing to prove. So assume I ^ t{e). We show by induction on 
the number of applications of =^ needed to reduce i{ui) to £{[I, i{e)]) in the 
rewriting system after Proposition 13.41 that i{ui) reads in cut(e) from I to 
t(e). Since a labels an edge from /,(e) to /' in cut(e), this will establish the 
claim. Recall that cut(e) is obtained by cutting (lp(e), /) at /. If Ui = [/, t(e)] 
there is nothing to prove since [/, t(e)] C |p(e) and doesn't use the edge e, 
and hence "is" the geodesic from / to i{e) in cut(e). Thus £{ui) reads from 
/ to z.(e) in cut(e). Next suppose that i{ui) =^ x where x reads a path from 
/ to i(e) in cut(e). By definition of =>, we can write £{ui) = £{s\p{f))z for 
some bold arrow / so that x = £{[I,T{f)])z. Since Ui does not revisit /, it 
follows that / does not end at /. Then since x labels a path in (lp(e),/) 
from / to t(e), it follows by the definition of a closed subgraph at / that 
/ G (lp(e), /) and hence, by Proposition l2.551 we must have lp(/) C (lp(e), /). 
Consequently, slp(/) = [/, T(/)]lp(/) is contained in (lp(e),/). Because Ui 
does not revisit /, it follows that £{ui) = i{5\p{f))z is readable from I to 
i(e) in cut(e). 

Thus we now have (using the inductive hypothesis) 

u = [e{toMuo)a)] ■ ■ ■ [^(t„_i)(^(n„_i)a)]£(s^) 

€ [/C (^, /, I,ry) JC (cut(e), /, r|,,t(e))] IC (^, /, /, r|,^) 

C /C I, I,ry) [/C (cut(e), /, l',rUie)) ^ (■^, h I, Asg)] * 

as required. This completes the proof of the theorem. □ 

4. Semigroup expansions 

4.1. Semigroups. Once again we fix an alphabet A (usually assumed fi- 
nite) . In later sections we often require semigroups to have certain finiteness 
properties such as being finite, or at least finite ^-above. A semigroup S 
is finite ^ -above if there are only finitely many elements in S which are 
^-above (i.e., > jr) any particular element of S. We also need the following 
equivalent characterization. In this paper we allow ideals to be empty. 

Lemma 4.1. A semigroup S is finite ^ -above if and only if there exists a 
family of co-finite ideals whose intersection is empty. 

Proof. If such a family of ideals exists, then for each s € S there exists 
an ideal I in the family with s ^ I. Since 5" \ / is finite and contains all 
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Notation 


Description 


Sgp 


semigroups 


FSgp 


finite semigroups 


FJSgp 


finite ^-above semigroups 


SgPA 


j4-semigroups 


FSgpA 


finite A-semigroups 


FJSgpA 


finite ^-above ^-semigroups 



Table 1. Commonly used categories 



the elements of S that are ^-above s, it follows that S is finite ^-above. 
Conversely, if S is finite then the empty ideal is co-finite and the result is 
immediate. If S is infinite and finite ^-above, then for each ^-class J 
let Ij denote the elements in S which are not ^-above J. Clearly these 
ideals are co-finite and we will show that their intersection is empty. Since 
S is infinite, for each s € S there is an element t which is not ^-above s. 
Because s is not in the ideal corresponding to the ^-class of st, evidentally 
s is not in the intersection of all such ideals and thus the intersection of 
these ideals is empty. □ 

Definition 4.2 (Categories). In Table [T] we list the categories commonly 
used throughout the article and the notations used to describe them. When 
the objects are ^-semigroups, the morphisms are restricted to A-morphisms. 



4.2. Straightline automata. In this subsection we define two finite-state 
automata for each finite ^-above ^-semigroup S and each word w € A~^, 
one which depends on the word w (the straightline automaton) , and a second 
which only depends on the element in S which w represents (the Cayley 
automaton). These two types of automata will be crucial to all of the results 
that follow. 

Straightline automata play a critical role in the solution to the word 
problem for Burnside semigroups [24J. 

Definition 4.3 (Straightline automata). Let {^,1) be a pointed ^-auto- 
maton and w a word in . The straightline automaton of w with respect to 
£/, denoted Str"^ (w), has an underlying directed graph which is a subgraph 
of iz/. The directed graph underlying the automaton Str^ (w) is the union 
of the path from / labeled by w, and the strong components of each of the 
vertices that this path passes through. This labeled directed graph is made 
into an acceptor by specifying the vertex / as its initial state and Iw as its 
terminal state. 

If S is an ^-semigroup and w is a word in A~^, then the straightline au- 
tomaton of w with respect to S is by definition Str'^(i(;) = Str^^'^^^'"^\w). 
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Observe that its vertex set consists of all of the vertices that represent ele- 
ments of which are ^-equivalent to elements of represented by initial 
segments of w. 

Remark 4.4. If /) is a finite pointed linear ^-automaton and w E 
reads from / to a vertex in the bottommost strong component of £/, then 
£/ = Str-^(tf). 

The following properties of Str'^(tt;) follow easily from the definition. 

Lemma 4.5 (Str'^(t(;)). If S is an arbitrary finite ^ -above A-semigroup 
and w is word in , then Str^^w) (with initial state I and terminal state 
[w]s) is a trim, deterministic finite-state acceptor which is linear and its 
strong components are Schiitzenberger graphs of Cay{S, A). In addition, a 
word X G is readable on Str'^{w) starting at the initial state if and only 
if X is ^-equivalent to an initial segment of w. 

The transition edges of the linear graph Str'^(w) are exactly those edges 
whose endpoints represent elements of S which are not .^-equivalent to 
each other, in other words, edges which represent descents in the ^-order. 
Thus every edge in Str'^(t(;) either is a transition edge or it lies in a unique 
Schiitzenberger graph. Let the vertices pi and qi be defined as in Defini- 
tion 12.91 and let Si and ti denote the elements of S which corresponding to 
these vertices. In addition, if Str'^(i(;) has exactly k + 1 strong components, 
we define po to be the initial state of Str'^(?i;) and qk to be its terminal state. 
For i = 0,1, . . . ,k, the vertex pi will be the first vertex in the path w which 
lies in the i^^ strong component and qi will be the last such vertex. Thus, pi 
and Qi can also be identified with specific initial segments of w. Under this 
scheme po = qo corresponds to the empty prefix and qk corresponds to word 
w itself. Notice that the vertices po and qq will always correspond since the 
Cayley graph used to define Str'^(?i;) is that of . 

Example 4.6. Figure 14.11 illustrates the structure of a typical straightline 
automaton. In this example there are A; -|- 1 strong components. The O*'^ 
strong component is trivial, as it always is, and the entry and exit points 
are equal in the third strong component. 




Figure 4.1. A straightline automaton 
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Of course, there is an obvious analogue of Lemma 14.51 for straightline 
automata associated to finite pointed automata. 

Definition 4.7 (Cayley automata). Let S be an A-semigroup and let w 
be a word in A'^ . The Cayley automaton of w with respect to S is the full 
subgraph of Cay{S, A) on the set of vertices which represent elements of 
^-above [w]s- The initial state and terminal state are defined as the vertices 
/ and [w]s, as before. The Cayley automaton of w will be denoted Cay'^(ri;). 
Notice that Cay'^(ti;) actually only depends on the element of S that w 
represents and not on the word itself. Thus, following the same convention 
as for Schiitzenberger graphs, if [w]s = s, we might write Cay'^(s) instead. 

The following lemmas records some elementary properties of Cay'^(u;). 

Lemma 4.8 {Cay^{w)). If S is a finite ^ -above A-semigroup and w is a 
word in A^ , then Cay'^(i(;) (with initial state I and terminal state [w\s) is 
a trim, deterministic, finite-state automaton which accepts the language of 
words equivalent to w in S. In addition, the finite-state automaton Str'^(?i;) 
is a subautomaton ofCay^{w). Moreover, a word x £ A~^ is readable on 
Cay^{vj) if and only if [x]s > ^ [w]s, it is readable on Cay^{w) starting at 
the initial state if and only if [x]s [w]s, and it is readable on Cay^{w) 
ending at [w]s if and only if [x]s >_jf ['w]s- 

Example 4.9. Figure [42] illustrates the structure of a possible Cayley auto- 
maton. Notice that the induced order on strong components is this example 
is not linear, and that transition edges between two particular strong com- 
ponents are not always unique. On the other hand, the automaton Str'^(?i;) 
shown in Figure HTT] is visible as a subautomaton of Cay'^(tt;). 



The situation where Str (tf) and Cay (w) coincide is particularly nice. 

Lemma 4.10 (Equal). Let S be a finite ^ -above A-semigroup and let w 
be a word in A'^ . IfStr'^{w) = Cay^{w), then this trim, deterministic finite 
state acceptor — which depends only on the element w represents — both 
is linear and accepts the language of words equivalent to w in S. In addi- 
tion, every element above [w]s is Si -equivalent to an initial segment ofw. 




Figure 4.2. A Cayley automaton 
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Similarly, if the left-handed versions of Str^ (w) and Cay^ (w) are identical, 
then every element -above [w]s is ^-equivalent to a final segment of w. 

Finally, we record a fact about Schiitzenberger graphs for later use. Recall 
that an element s of a semigroup 5 is regular if s = sts for some t ^ S. For 
instance, idempotents are regular. 

Lemma 4.11 (Sch'^(ti;)). Suppose S is an A-semigroup, w and x are words 
in , and w represents a regular element of S. Then x is readable on 
Sch'^(tt;) if and only if [x]s > j' [w]s- 

Proof. Consider the automaton Cay'^(?i;) and recall that Sch'^(t<;) is the 
strong component of the vertex [w]s in Cay'^(w). If x is readable on Sch'^(t(;), 
then since the graph is strongly connected, there exist words y,z G A* so 
that yxz is readable as a loop starting and ending at [w]s- But this means 
that wyxz is accepted by Cay'^(ti;) and hence [wyxz]s = [w]s- In particular, 
[x]s > J' [Ms- Conversely, if [x]s is ^-above [w]s then there exist y,z G A* 
such that [yxz]s = [w]s- Suppose that [ujrw]s = [w]s with r € using 
regularity of [wjs- Then this means that [u)ryxz]s = [ujrw]s = [w]s- Since 
Cay'^{w) is partial deterministic this forces ryxz to be readable as a loop 
in Sch'^(it;) based at [w]s and, in particular, x is readable in Sch'^(w), as 
[wry]s M [w]s. □ 

4.3. Expansions. In this subsection we review the general notion of an 
expansion and record some of their chief properties. 

Definition 4.12 (Expansions). Let C be a subcategory of Sgp. An ex- 
pansion on C is a functor E: C ^ C together with a surjective natural 
transformation r] from E to the identity functor Ic on C; see [23] for more 
on functors and natural transformations. More explicitly, an expansion as- 
signs to every semigroup S in C a semigroup and a surjective morphism 
r]s : S, and to every morphism / : 5 — )• T in C it assigns a morphism 

f^ : —7- in C so that the diagram 

gE — ^ rpE 



VS 



VT 



f 

commutes. The semigroups and morphisms assigned must remain in C. 
One must assign identity morphisms to identity morphisms and composition 
must be preserved. If remains finite whenever S is finite, we say that E 
preserves finiteness. Suppose that E and F are expansions. The semigroup 
{S^f is denoted S^-^. 

Most of the expansions defined in this article are expansions on a cate- 
gory of ^-semigroups. Notice that such an expansion is uniquely determined 
by the semigroups assigned since ^-morphisms such as f^ : — >■ and 
rjs '■ S^ S are uniquely determined by their domain and range whenever 
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they exist. Thus only the existence of morphisms such as / and rjs need 
to be checked: ah of the remaining functoriahty requirements follow auto- 
matically. We also encounter situations where the objects can be expanded 
but the morphisms cannot. In other words, for every S, there is an object 
and a map -» S, but for maps S — )• T there may or may not exist a 
map — )• T^. In this situation we call S i— )• an object expansion. 

The next lemma essentially shows that an expansion which preserves 
finiteness also preserves the property of being finite ^-above. 

Lemma 4.13 (Preserving finiteness). If E is an expansion on SgpA which 
preserves finiteness and for all semigroups S G SgpA and for all ideals 
I Q S, there is a map {S/I)^ {S^/J) where J = rig^{I), then E applied 
to a finite ^ -above semigroup remains finite ^ -above. 

Proof. Let S be a finite ^-above ^-semigroup and let J-" be a family of 
co-finite ideals in S whose intersection is empty (Lemma 14. ID . Consider the 
inverse images of these ideals under rjs : — >■ S. The intersection of these 
ideals in is clearly empty, and they are also co-finite since for each 1 ^ T ^ 
/ J is the image of the semigroup (S/I)^ which is finite by hypothesis. □ 

For later use we briefly review some of the most common expansions. 

Definition 4.14 (Common expansions). Let S be an ^-semigroup. Roughly 
speaking, the reverse or right Rhodes expansion remembers the exit points 
as the path w travels through Cay^{w), while the reverse or right Karnof sky- 
Rhodes variation remembers the actual transition edges used (i.e., exit point, 
entry point and particular edge traversed) . Note that the reverse Karnofsky- 
Rhodes expansion of S clearly maps onto the reverse Rhodes expansion 
of S. The reverse Rhodes expansion is thus a fairly small expansion in 
which Cay^{w) is quasilinear for all w € A'^ , and the reverse Karnofsky- 
Rhodes expansion, is the smallest expansion such that Cay'^{w) is linear for 
all w G A~^. We denote the reverse Rhodes expansion by and the reverse 
Karnofsky-Rhodes by Ail. 

The original versions of these two expansions were left-handed versions 
which we denote and Ail°^. In other words, the statements above hold 
when we define left-handed versions of Str'^(u;) and Cay^{w) using the left 
Cayley graph and we read words from right-to-left instead of left-to-right. 
The Birget-Rhodes expansion, denoted BR, is the limit ^-semigroup ob- 
tained by iterating the Rhodes and the reverse Rhodes expansions until the 
result stabilizes. All five of these expansions preserve finiteness. See [6] for 
a precise definition of the Birget-Rhodes expansion, and [13], [H] or [M] for 
precise definitions of the Rhodes expansion, the Karnofsky-Rhodes expan- 
sion, and their reverses. 

For the reader's convenience we include here the definition of the right 
Karnofsky-Rhodes expansion (cut to the generating set A) so that the reader 
gets the idea. 
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Definition 4.15 (Karnofsky- Rhodes expansion). If S is an j4-semigroup, 
define a congruence on by putting u = v if: 

(1) [u]s = [v]s; 

(2) The paths read by u and v from / to [u]s = [v]s in Cay(S', vl) use 
the same transition edges. 

The quotient semigroup A'^ /= is denoted 5^^"*". It is immediate from (1) that 
there is a surjective morphism rjs : S^^'^ S. If / : S — ?> T is a homomor- 
phism of ^-semigroups, then f{s) A f{s)a a transition edge of Cay{T,A) 
forces s A- sa to be a transition edge of Cay (5, A). It follows that there 
exists a map J^^"*" : S"^^"*" — T^^^ and so we have defined an expansion on 
the category of ^-semigroup. It is easy to see from (1) and (2) that All 
preserves finiteness. 

Remark 4.16. It is known that the projection r/^: S^^-^ ^ 5 is universal 
amongst maps from ^-semigroups to S with derived semigroupoid dividing 
a locally trivial category [T^IHT]. 

When S = we say that S is stable under the expansion E. The 
following result is an immediate consequence of Definition 14.141 

Lemma 4.17. Let S be a finite ^ -above A-semigroup. If S is stable un- 
der the reverse Rhodes expansion A^, then Str'^(t(;) and C3y^(w) have the 
same vertex set for all words w € A~^ and if S is stable under the reverse 
Karnof sky-Rhodes expansion, Ail, then Str'^(?i;) = Cay^{w) for all words 
w G A~^ and all of the properties listed in Lemma \4-iO\ must hold. Simi- 
lar results hold for the left-handed versions when S is stable under A^ or 

An°p. 

Proof. We just handle the case of the reverse Karnosfky- Rhodes expansion 
as the other case is similar. Since Str'^(ri;) is always a subautomaton of 
Cay'^(ii;), it suffices to show the reverse inclusion. To do this, it suffices by 
the definitions to show that each ^-class of elements .^-above [w]s contains 
an initial segment of w. Suppose that [u]s [w]s- Then [w]s = [uv]s for 
some V £ A* . We can factor u = u'u" so that reading u" from [u']s stays 
in a Schiitzenberger graph, i.e., the last letter of u' labels the last transition 
edge e read by u from / in Cay{S,A). The since S = S^^-^, it follows that 
w uses the same transition edges as uv and so, in particular, uses e. Thus 
w = w'w" where the last edge read by w' from / is e. Then [w']s = [u']s ^ 
[u'u"]s = VAs- This establishes that each .^-class of elements ^-above [w\s 
contains an initial segment of tf, as was required. □ 

For expansions closed under iteration, there is a general stability result. 

Lemma 4.18 (Stability). Let C be a subcategory of SgpA oind let E and 
F be expansions on C. //, for every A-semigroup S in C, S^'^ = and 
there exists a map , then for every S in C, = S^'^ = S^'^ . In 

other words, is stable under F. Conversely, if = S^'^ or = S^'^ , 
then there is a map from — )• . 
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Proof. If we apply the S-expansion to the maps — )• — )• 5" we get 
maps in each direction between and S^'^ as a result of the hypothesis 
that S^'^ = . Within the category of A-semigroups this implies they are 
isomorphic. Similarly there is a map since F is an expansion 

and a map 

gE.E ^ gE.F 

since the hypothesis of the theorem can be applied 
to the instead of S. Once again, maps in both directions implies they are 
isomorphic. The final assertion is easy since S^'^ — >■ is the F-expansion 
applied to map S and S^'^ is the ii^-expansion of semigroup 

projected back to . □ 

We conclude this section by reviewing the notion of a relational morphism, 
its relationship with expansions, and a remark on the most common ways 
to construct expansions. 

Definition 4.19 (Relational morphisms). Let S and T be semigroups and 
recall that a relational morphism from S to T is a subsemigroup R S x T 
such that the natural projection from i? to 5 is onto. If (p: R ^ S and 
tp: R ^ T denote the natural projections restricted to R, then the "rela- 
tion" alluded to in the name relational morphism is the relation o ip~^. A 
relation-based definition of a relational morphism also exists, but the equiv- 
alent definition given above is easier to work with in practice. We thus refer 
to the subsemigroup C S" x T as the relational morphism from S" to T 
even though technically, it merely encodes the corresponding relation. 

In the category of ^-semigroups, the concept is further simplified since 
there is a canonical relational morphism between any two ^-semigroups, 
which automatically projects onto both the domain and the range. It is 
defined as follows. Let S and T be ^-semigroups with functions f : A ^ S 
and g: A T. Their product in the category SgpA (sometimes called 
their product cut-to-generators) is the subsemigroup of S x T generated by 
the ordered pairs {f{a),g{a)), a ^ A. Call this ^-subsemigroup R and let 
ip: R ^ S and ip: R ^ T he the A- morphisms derived from the projection 
maps. The canonical relational morphism between S and T is the relation 
ip(p~^- The reader should be aware that the composition of two canonical 
relational morphisms need not be canonical. 

The concept of a relational morphism is critically important in finite semi- 
group theory (see, for example, [13, 26,_33j 36, 38,l39]). Moreover, given a 
semigroup variety V and a finite A-semigroup S, it is also important to 
have detailed information about the relational morphisms from S to the 
j4-semigroups V gV, particularly when 1/ € V is itself finite. 

Remark 4.20 (Relational morphisms and expansions). The notion of a rela- 
tional morphism is very closely tied to that of an expansion and the problem 
of constructing relational morphisms can often be reduced to the problem 
of constructing finite expansions of finite ^-semigroups. Let E be an ex- 
pansion on FSgpA- Starting from an A-semigroup S and a variety V, the 
semigroup can be sent to its maximal image V ^ V (which exists since 
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V is a variety). Moreover, since is finite, V is finite. Then, 



5^ 



V 



s 



is a relational morphism. Conversely, given a systematic method for con- 
structing relational morphisms 



where S is an arbitrary finite semigroup, ^ is a finite member of V (de- 
pending on S) , and all semigroups and morphisms are A-semigroups and A- 
morphisms, the assignment 5 i— > i? is often an expansion on SgpA- See [6] 
and |13j for a more detailed discussion. 

Remark 4.21 (Constructing expansions). Four of the most common ways to 
construct expansions on FSgpA are: 

(1) Ramsey Theory; 

(2) Zimin words and uniformly recurrent sequences; 

(3) Semidirect product expansions; 

(4) Mal'cev expansions. 

The first method was applied brilliantly by Ash in [2] to solve the Type- 
II Conjecture (see [15j for a good general reference on Ramsey theory and 
see [Tl[2l[5l[18l[30] for other applications to semigroup theory). The second 
method relies heavily on classical universal algebra and semigroup theory, 
mixed with combinatorics (the article [19] contains an excellent survey of 
this approach). The third method, which includes the Karnofsky- Rhodes 
expansion, is exposited in [H] (see also [3lJ). The fourth method is the one 
used in this paper and is described in the next section; see also [14','31]. 

The last two methods of constructing expansions are encompassed by the 
notion of relatively free relational morphisms with respect to a variety of 
relational morphisms [32, Chapter 3]. 

4.4. Mal'cev expansions. In this subsection we define the Mal'cev expan- 
sion of a semigroup with respect to a variety. The results in this section are 
in general well-known but they provide a language and a context for the 
newer results contained in the later sections. 

Definition 4.22 (Mal'cev kernels). The Mal'cev kernel of a semigroup mor- 
phism ip: S* — 7- T is the collection of inverse images of idempotents. In other 
words, it is the set of subsemigroups {ip~^{e)} where e is an idempotent in 
T. If each semigroup in the Mal'cev kernel is contained in a variety V we 
say the Mal'cev kernel lies in V. 



R 



V 



s 
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The main result we need involving Mal'cev kernels is Brown's theorem. 
Recall that a variety is called locally finite if its finitely generated free objects 
remain finite (or equivalently its finitely generated members are all finite). 

Theorem 4.23 (Brown [8]). Let ip: S T be a map between two A- 
semigroups where A is finite. If T is finite and the Mal'cev kernel of ip 
lies in a locally finite variety V, then S is also finite. 

Algebraic proofs of Brown's theorem can be found in [22] and |32l Chapter 
4]; see also [S^- Mal'cev expansions were considered in [T3] (a profinite 
analogue for pseudovarieties was introduced in [31]). 

Definition 4.24 (Mal'cev expansions). Let S be an ^-semigroup and let V 
be a variety. The Mai 'cev expansion of S by V is the largest ^-semigroup 
which maps to S with Mal'cev kernel in V. More precisely, for each A- 
morphism T ^ S whose Mal'cev kernel lies in V, define ~t as the congruence 
on the free semigroup A'^ which produces T, i.e., the congruence which 
corresponds to the A-morphism A'^ T. The Mal'cev expansion of S by 
V is then defined as the semigroup where ~ is the intersection of all 

of these congruences ~t on A~^. The Mal'cev expansion of S" by V will be 
denoted 5^. See [MH] for a more detailed discussion of Mal'cev expansions. 

Some properties of Mal'cev expansions are immediate from the definition. 

Lemma 4.25. Let S be an A-semigroup and let V be a variety. The semi- 
group is the largest A-semigroup which maps to S with Mal'cev kernel 
in V. More precisely the canonical A-morphism rjs: — ?• S has Mal'cev 
kernel lying in V and if ip. T S is another such A-morphism, then there 
is a (unique) A-morphism S*^ — T so that 




T 



commutes. 

Remark 4.26 (A presentation of S^). One can alternatively construct 5^ 
as follows. Let be a basis of identities for the variety V. Define an 
A-semigroup S' with presentation consisting of all relations of the form 
u{wi, . . . ,Wn) = v{wi, . . . ,Wn) where u = f is an identity from E in re- 
variables and wi, . . . ,vun £ A~^ all map to the same idempotent of S. By 
construction, 5" —> 5 is a well-defined A-morphism with Mal'cev kernel in 
V. To verify the universal property of Lemma 14.251 notice that if T is any 
A-semigroup with T — )• 5 having Mal'cev kernel in V and u,v,wi, . . . , Wn are 
as above, then since [wi]tj • • • > [wuIt generate a semigroup in V they satisfy 
[u{vji, . . . , Wn)]T = [v{wi, . . . , Wn)]T and so there is a morphism S' ^ T of 
A-semigroups. Thus S" = 5^. 
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Notice that the functoriahty of the assignment S i— )• S"^ is clear from this 
presentation. Indeed, if / : 5" — > T is a homomorphism of j4-semigroups and 
wi, . . . ,Wn G map to the same idempotent of S, then also wi, . . . ,Wn 
map to the same idempotent of T. Thus satisfies all the defining relations 
of and so there is a morphism : 5^ — )• T^. 

One of the main goals of geometric semigroup theory is to determine the 
structure of S"^ for various locally finite varieties V. Note, however, that even 
taking a Mal'cev expansion with respect to the trivial variety is a very non- 
trivial operation. If 5 is a finite semigroup and A is the set of all elements 
in S, then Chris Ash studied, in essence, the Mal'cev expansion with 
respect to the generating set A in his solution to the Type II conjecture. 
See [HEll^ for details and [6] for an early version of this approach. 

The following theorem establishes the key properties of Mal'cev expan- 
sions, namely that they are indeed expansion in the sense of Definition 14.121 
and that when the variety V is locally finite, they preserve finiteness prop- 
erties. 

Theorem 4.27. For each variety V, the assignment S i-^ defines an 
expansion on the category SgpA- Moreover, if V is locally finite, then it 
preserves finiteness. In particular, it restricts to an expansion on FSgpA 
or FJSgpA- 

Proof. Remark 14.261 shows that S i— )• is an expansion. 

To show the second assertion we need to show that if S is finite, then 
is finite, and if S is finite ^-above, then S"^ is finite ^-above. When 
S is finite, the assertion follows immediately from Brown's theorem (The- 
orem |4]23]). When S is finite ^-above we will use Lemma 14.131 Let / be 
an ideal in S and let J be the inverse image of / under the ^-morphism 
S. Notice that the Mai' cev kernel of S^/J S/I is a, subfamily 
of the Mal'cev kernel of — )• 5 with the possible addition of the trivial 
semigroup, and thus lies in V. It now follows from Lemma 14.251 that there 
is an j4-morphism from (S*//)^ to S'^ / J and by Lemma 14.131 the proof is 
complete. □ 

A list of some locally finite varieties is given in Table [21 
The lattice of locally finite varieties is closed under taking joins, subvari- 
eties and semidirect products. The concept of a Mal'cev product can also 
be extended to form a (nonassociative) product on the set of varieties. 

Definition 4.28 (Mal'cev products of varieties). lilA and V are varieties of 
semigroups, the Mal'cev product of lA and V (denoted lA @V) is the variety 
generated by all semigroups S admitting a homomorphism ip: S ^ V with 
y G V so that the Mal'cev kernel of belongs to U. It follows from Brown's 
theorem that the Mal'cev product of two locally finite varieties is also a 
locally finite variety, and, as a result, this table can be extended by taking 
Mal'cev products. Some care must be exercised with parentheses, since 
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Notation 


Description 


Equations 


{1} 


trivial 


X = 1 


S£ 


• 1 J- J. ■ 

semiiattices 


'2 

xy = yx, X = X 




right-zero semigroups 


xy = y 


LZ 


leit-zero semigroups 


xy = X 


B 


bands 


X'^ = X 


TZB 


rectangular bands 


'2 

X = X, xzy = xwy 


CC 


left constants 


xy = xz 


TZC 


right constants 


yx = zx 


C 


2-sided constants 


xzy = xwy 


T^k 


A;-delay 


Xl - ■ - Xk = XoXl ■■■Xk 


MUCC 


null semigroups 


xy = 




nilpotent of class k 


xi • • -Xfc = 




commutative semigroups 
satisfying = x™'^" 


xy = yx, X™ = X™'"'""' 


{S) 


variety generated by a 
finite semigroup S 


Eqs(5) 




vector spaces over Zp, 
p prime 


xy = yx, xP = 1 



Table 2. Locally finite varieties 



Mal'cev products are not necessarily associative. The inclusion 

U @ {V ®W) <^ {U @V) @W 
always holds, but a strict inclusion is possible. 

Numerous relations exist between the locally finite varieties listed in the 
table with respect to the Mal'cev product. For example B = B @ B, as is 
immediate from the definition. It is a well-known consequence of Green- 
Rees structure theory that B = TZB @ SC. In contrast, if we replace B by 
B = C @ B, then finite iterated Mal'cev products of B, bracketed in the 
larger way, contain all finite aperiodic semigroups by the two-sided Prime 
Decomposition Theorem |32ll33j . 

As an illustration of the power of Lemma 14.251 we prove that the Mal'cev 
expansion with respect to the variety of rectangular bands applied to the free 
semilattice over A is the free band over A. Actually, with additional work, 
one could prove that the Birget-Rhodes expansion of the free semilattice 
is already equal to free band, but that fact is harder to derive using these 
techniques. 

Lemma 4.29. Let V and W be varieties. Then the free semigroup on A 
in V @ yV is i<>v(A)^ where F-y\;{A) is the free semigroup on A in W. In 
particular, the rectangular band expansion of the free semilattice is the free 
band. 
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Proof. Suppose that a: A — )• T is a map with T E V @ W. Then we can 
write T as a quotient n: S ^ T where S admits a morphism 9: S ^ W with 
the Mal'cev kernel of 9 in V. Let t : A S he a. map with a = ttt. Clearly 
to extend a to Fy\;{A)^ it suffices to extend r. Without loss of generality 
we may assume that 9 is onto and that t{A) generates S. Then there is an 
j4- morphism W"^ — )• S by the universal property in Lemma 14.251 But there 
is an A-morphism Fy^;(A) — ?> W since W € W, and so by functoriality we 
have an ^-morphism Fy^{A)^ — )• VF^ S, completing the proof. □ 

4.5. Rectangular bands. In this subsection we investigate the Mal'cev 
expansion with respect to the variety of rectangular bands and the closely 
related varieties of left zero/right zero semigroups. Let S be an ^-semigroup 
and consider the semigroups S^^, S^^, and 5^^. Since left and right zero 
semigroups are examples of rectangular bands, it is immediate from the 
definition that there are maps S^'^ S'~'^ and . We begin 

by establishing properties of the projection S''^'^ S. Recall that if 
is any of Green's relations, then a morphism 99: S" — )• T is a J^'-map if 
s,s' € S regular and ip{s) = (p{s') implies that s s'. See [201 Chapter 8] 
or [321 Chapter 4]. 

Lemma 4.30. If S is an A-semigroup then the map ip: IS one- 

to-one on each subgroup of . Moreover, if S is finite ^ -above, then 
the inverse image of each regular ^ -class of S contains a single regular 
^ -class in S*^^. In particular, (p is a -map. 

Proof. Let G be a subgroup of S*^^ and let e be its identity. Since (p re- 
stricted to G is a group homomorphism, it is injective on G if and only 
if k.eT(p\G = {e}. On the other hand, e and 99(e) are idempotents, so the 
inverse image 99"^ (99(e)) is a rectangular band by definition. Since the only 
idempotent in a group is its identity, we can conclude the restriction is in- 
deed one-to-one. Next, let J be a regular ^-class of S and let x and y 
be idempotents of (p~^{J). Since 99(2;) and 99(1/) are ^-equivalent (and 9? 
is onto) there exists u and v in such that ip{uxv) = 99(2/) = e, where 
e is an idempotent of J. On the other hand, 99"^ (e) is a rectangular band 
by definition, so uxv and y are ^-equivalent in S^'^ . Thus x > ^ y and 
a similar argument shows y >/ x. It follows that 9?^^ (J) contains only a 
single regular ^-class. □ 

Remark 4.31. One can similarly show that S'~'^ — )• S" is an ^'-map and 
5^^ ^ 5 is an ^'-map [32l Chapter 4]. 

Because S'^^ ^ S" is a ^'-map which is injective on subgroups, when S is 
finite the techniques and results from [32l[3l] can be applied. In particular, 
and S have the same complexity |13p32j. In the next lemma, we show 
that all three expansions are stable under iteration. 

Lemma 4.32 (Stable under iteration). A map 9: S T has Mal'cev kernel 
in CZ (TZZ) if and only if the inverse image of each left zero (right zero) 
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subsemigroup in T is a left zero (right zero) subsemigroup in S. Similarly, 

has Mai 'cev kernel in TZB if and only if the inverse image of each rectangular 

band in T is a rectangular band in S. As a consequence, S'~'^'^^ = S'~'^ , 
gnz.nz ^ gnz^ gnB.UB ^ gUB ^ 

Proof. We just handle the case of rectangular bands, as the other cases are 
similar (and this result is any event well known). Clearly, it suffices to handle 
the case of a surjective morphism 6: 5 — > T of bands with T a rectangular 
band and 9 a morphism with Mal'cev kernel in TZB. By Lemma 14.301 S 
has a single ^-class. It follows that 5 is a rectangular band. Finally, to 
see the last assertion, let E denote either CZ, TZZ, or IZB and consider the 
composition of maps S^'^ — > — > S. An idempotent in S pulls back to 
a left zero semigroup/right zero semigroup/rectangular band in 5^ which 
we now know pulls back to a semigroup of the same type in S^'^ . Thus, 
by Lemma 14.251 there exists a map — ?■ S^'^ . As usual, maps in both 
directions implies they are isomorphic. □ 

One consequence of being stable under the 7^0-expansion is that the 
semigroup remembers the first and last letters of the words representing an 
element in the following sense. 

Lemma 4.33 (Vertex labels). If S is a non-trivial finite ^ -above A-semi- 
group which is stable under the TZB-expansion and u and v are words in A~^ 
which represent the same element of S then they have the same first letter 
and the same last letter. As a consequence, there is a unique edge starting 
at the initial state of Str^{v) and for every vertex v in Cay{S,A), all of the 
edges ending at v have identical labels. 

Proof. Let S — )• {1} be the collapsing morphism. Viewing {1} as an A- 
generated semigroup via this map, functoriality yields a surjective mor- 
phism S'^^ — > {1}^^. Since {1}^^ is the free rectangular band on A by 
Lemma 14.291 and hence two words are equal in it if and only if they have 
the same first and last letter, the result follows. □ 

A similar statement and proof shows that S^^ remembers the first letter 
and S*^^ remembers the last letter. The CZ, TZZ and T^^S-expansions are 
also stable under many of the expansions defined in Definition 14. 14[ 

Theorem 4.34 (Stable under other expansions). For each A-semigroup S, 
there exist A-morphisms as shown in Figure ^ consequence, IS 
stable under Ail and NM, S^-^ is stable under Ail°P and A^, and S'^'^ is 
stable under all of the expansions shown. 

Proof. The maps that are not obvious are consequences of well-known prop- 
erties of the Rhodes, Karnofsky-Rhodes and Birget-Rhodes expansions [6l 
[T4tl31j. For instance, we verify the projection rjs'. S*^^"*" — t- S has Mal'cev 
kernel in CZ. Suppose v,w £ A~^ map to an idempotent e in S. Then since 
[vw]s = e'^ = e = [v]s, it follows that v and vw use the same transition 
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gCZ ^ gUB ^ gnz 



S 



A£l 



S 



gBR ^ ^Aif 



Figure 4.3. Relations between various expansions. 

edges from / in Cay(S', A) in the paths they read to e. Thus vw = v in S^^'^. 
It fohows that ??^^(e) is a left zero semigroup. 

By Lemma 14.321 the expansions CZ, TZZ and TZB are closed under iter- 
ation, so the stability properties of S^^ and follow by Lemma 14.181 
Similarly, is stable under all of these with the possible exception of BR. 
However, once S''^'^ is stable under AJf and A^, it is also stable under BR 
and by Lemma 14.18^ there is a map 

Combining Lemma 14.171 and Lemma 14.321 proves the following. 

Corollary 4.35. If S is a finite ^ -above A-semigroup and T is either 
gcz g-RB^ ^/^g^ Str^(u') = Cay"^ (w) is a trim, deterministic finite state 
acceptor which depends only on the element w represents, it is linear and 
it accepts the language of words equivalent to w in T. In addition, every 
element ^-above [w]t is ^-equivalent to an initial segment of w. Similar 
properties hold for the left-handed versions when T is either or . 

We thus have the following result, which shows that we can use straight- 
line automata (and in particular the factors of its base) to investigate Green's 
relations in S''^^ . This is a first small step toward converting algebraic in- 
formation into geometric form. 

Corollary 4.36 (Green's relations). Let S be an A-semigroup, let T = S'^^ 
and let u and v be words in A~^ . Then [u]t > _/ [v]t if and only if u is 
readable in Str"^(f). Moreover, [u]t '>pyg [v]t if o,nd only if u is readable in 
Str'^(f ) starting at the initial state if and only if [u]t is ^-equivalent to an 
element represented by an initial segment ofv, and [u]t >y [v]t if and only 
if u is readable in Str^(f) ending at the terminal state if and only if [u]t is 
^-equivalent to an element represented by a final segment ofv. 

Another immediate corollary of these results is that loops in straightline 
automata can be "read backwards" in the following sense. 

Corollary 4.37 (Reading backwards). Let S be an A-semigroup stable un- 
der the TZB-expansion and let w and u be words in A'^ . Ifwu and w represent 
the same element of S, then w is readable in Str^ {w) starting at I and u is 
readable as a loop starting and ending at the vertex [w\s. Moreover, there 
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is a partition of w = W1W2, with Wi G A* such that W2 is ^-equivalent to 
u. In particular, there exists words v and v' such that [v'u]s = [w2]s CL^d 
[vw2\s = [u\s o-f^d the entire configuration of paths shown in Figure \4^\ can 
he read in Str^{w) starting at I. 

Proof. The first statement is clear. By Corollary 14.361 since [u]s [w]s, 
we can write w = W1W2 with [u]s ['W2]s- The remaining statements of 
the corollary are straightforward. □ 




Figure 4.4. A standard loop before and after factoring. 

Corollary 14.371 takes on added significance once we prove that can 
be presented using only relations of the form uv = u. 

Definition 4.38 (Loop relations). A loop relation is one of the form uv = u 
for words u,v € A*. An ^-semigroup has a loop presentation if it has pre- 
sentation in which each relation is a loop relation. When focusing on the 
Jf-order instead of the M-ovdei, we consider reverse loop relations uv = v, 
which lead to reverse loop presentations. Notice that Proposition 13.21 imme- 
diately implies that the Cayley graph of a semigroup with a loop presentation 
is a loop automaton. 

Lemma 4.39. For each A-semigroup S, the semigroup can he given a 
loop presentation. In particular, can be defined by adding the relation 
uv = u whenever the words u,v £ A~^ represent the same idempotent e in 
gTlB _ j^^g analogous reverse loop presentation is another presentation of 

Proof. This is immediate from Remark 14.261 □ 

The fact that S'^^ has a loop presentation has important consequences 
for its straightline automata. In particular, they are all loop automata. 

Lemma 4.40. If S is a finite ^ -above A-semigroup with a loop presenta- 
tion, then, for all words w S A'^ , Cay^{w) is a loop automaton. 

Proof. We already observed that Cay(5, A) is a loop automaton. Clearly a 
loop presentation of Cay^{vu) can be obtained by using the loop relations 
from the presentation of Cay(5, A) and by adding the dead-end relations 
ua = d where [u]s [uj]s, a £ A and [ua]s [w]s- D 

Actually, more is true. For each specific word w only a finite number of 
the loop relations are needed in order to build its straightline automaton. 
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Lemma 4.41. If T is a finite ^ -above A-semigroup with T = , and w 
is a word in , then the automaton Str^(i(;) is a loop automaton defined 
by a finite number of loop relations. 

Proof. By Corollary 14.351 Lemma 14.391 and Lemma I4.4UI Str^(w) is a loop 
automaton, but we must prove that it requires only a finite number of loop 
relations. There is a cofinite ideal I of T so that Str'^{w) = Str^^\w) so we 
may assume without loss of generality that T is finite. 

By Ramsey theory pBJ, if a word W £ A~^ is sufficiently long relative to 
the size of T, then W can be partitioned into factors so that W = wiuvw2 
with Wi G A* and where u, v and uv all represent the same element of T 
(which is necessarily an idempotent). In the notation from [15] we need 
\W\ > [3]^ where m = \T\; call this lower bound n. Indeed, if = ai • • • a„ 
with the Oj letters, then consider the complete graph L on {0, . . . ,n}. We 
color the edges of T by elements of T. More precisely, if z < j are elements 
of {0, . . . ,n}, then the edge between i and j is colored by [a^+i • • • ajlx- 
Ramsey's theorem [15] provides a monochromatic triangle. Such a triangle 
corresponds to i < j < k with 

[fli+i • • • aj]T = [aj+i ■ ■ ■ ak]T = [oj+i • • • Ofclr 

and so taking u = Oj+i ■ ■ ■ Oj, v = aj+i ■ ■ ■ Ok gives the required partition. 

Thus by applying a loop relation of the form uv = u, with \u\, \v\ < n, 
and u and v representing the same idempotent in T, all words are equivalent 
to words of length at most n. For this finite number of short words, a finite 
number of loop relations of the form Ujfj = Uj with Ui,Vi equal to the same 
idempotent in T are needed to derive whatever further equalities are needed. 
This gives a finite list of defining loop relations for Str"^(u;). □ 

Next, we show that is almost never a monoid, but before doing so, we 
note that when S is the trivial semigroup generated by ^ = {a}, then 
is still trivial. This is because the entire semigroup must be a rectangular 
band and a one-generated rectangular band is trivial. On the other hand, 
this turns out to be the only situation where is a monoid. 

Lemma 4.42 (Almost never a monoid). If S is an A-semigroup then 
is not a monoid unless \A\ = 1 and S = {1}. 

Proof. If 1^1 > 2, then the collapsing morphism S* ^ {1} yields a surjective 
morphism S'^'^ {1}^^ (where {1} is viewed as ^-generated). Since {1}^^ 
is the free rectangular band on A, and hence not a monoid, it follows that 
is not a monoid. So we may suppose that |^| = 1, i.e., S is cyclic. 
If S is not a monoid, then clearly is also not a monoid. So we may 
assume that S" is a finite cyclic group of order n > 2 generated by a. Let 
T be the semigroup given by the presentation (a | = a^"*""). Then the 
canonical morphism T ^ S has trivial Mal'cev kernel. Thus T is a non- 
monoid homomorphic image of and so is not a monoid. □ 
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As a consequence of Lemma [4. 42 ^ our decision to use 5^ rather than only 
adding an identity when one is not already present is irrelevant once S is 
stable under the T^^S-expansion. 

4.6. Improving stabilizers. In this short subsection we first consider the 
effect of the rectangular band expansion on stabilizers, and then we show 
how an expansion due to Le Saec, Pin and Weil can further simplify their 
structure. 

Definition 4.43 (Stabilizers). Let 5 be a semigroup. The right stabilizer 
of s € 5, is the subsemigroup 5^ = {t | st = s}. Similarly, there are left 
stabilizers sS = {t \ ts = s} and double stabilizers 

s'Ss = s'S n Ss = {t \ ts = s and s't = s'}. 

The next lemma allows us to equate right stabilizers with loops accepted 
by Schiitzenberger graphs, thus turning the study of right stabilizers into an 
investigation into the properties of various automata. 

Lemma 4.44 (Loops in Sch'^(s)). Let S be an A-semigroup and let s be an 
element of S. The elements of S represented by words accepted by Sch'^(s) 
are precisely the right stabilizers of s, i.e., the elements in Ss- 

Proof. If u is a word representing s and u is a word accepted by Sch'^(s), 
then uv is accepted by Cay'^('u) so that uv and u both represent s, whence 
■s • [fj^ = [uv]s = [u\s = s, and [v]s is a right stabilizer of s. Conversely, if 
i is a right stabilizer of s and u and v represent s and t respectively, then 
[uv]s = [u]s and uv is accepted by Cay'^(ii). Moreover, since Cay'^(u) is 
deterministic, v is read as a loop based at [u]s in the strong component 
of [u]s, but this is the Schiitzenberger graph Sch'^(s) and v is reading an 
accepted loop. □ 

In a left cancellative semigroup, a semigroup S acts on the left of its 
Cayley graph by injective functions. In the next proposition, we show that 
if S is stable under the 7^Z-expansion, then the left action of 5 is injective 
when restricted to Schiitzenberger graphs. Recall the a partial order is said 
to be unambiguous if x < y,z implies y, z are comparable. 

Theorem 4.45. Let S be a finite ^ -above A-semigroup stable under the 
IZZ -expansion. Then the ^-order on S is unambiguous. Moreover, if 
r,s,t G S with r t and sr = st, then r = t. Consequently, if s,t G 5, 
then the natural map Sch^ {t) Sch'^(st) induced by left multiplication by s 
is an injective morphism of A-automata. 

Proof. It is conceptually easier to establish the dual result when S is stable 
under the iZZ-expansion, which we proceed to do. We use that S = S^^-^. 
Suppose that r, t have a common lower bound in the ^-order on S. Let 
u,v,w,z G A~^ satisfy [u\s = r, [v]s = t and [uw]s = [vz]s. We prove that 
r and t are comparable in the ^-order. By hypothesis, the paths read by 
uw and vz from / in Cay (5, A) use the same set {ei, . . . , Cm} of transition 
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edges. Since Cay(5', A) is linear, we may assume the indices are chosen so 
that ei y €2 ■ ■ ■ y Cm- Hence the set of transition edges used by u in its run 
from / is of the form {ei, . . . , e^} for some k < m and similarly the set used 
by V is of the form {ei, . . . , e^} for some r < m. Without loss of generality 
we may assume that k < r. It then follows that [u]s ><% \v\s- 

Next suppose that r,t S satisfy r ^ t and rs = ts. Then since rs = ts is 
a common lower bound to r and t in the i^-order, they are comparable by the 
previous paragraph. It follows that [u\s \v\s in 5". Choose u,v,w € 
such that [u\s = r, [v]s = t and [wjs = s. There are two cases. Suppose 
first [uw]s ^ [^]5- Then since [itifjs = Green's Lemma implies 

\u\s = [v]s, as required. Next suppose that [u]s [uwjs- As [uwjs = 
[vw]s, both uw and vw use the same transition edge e when exiting the 
Schiitzenberger graph of [u]s- Also note that if w' is an initial segment of 
w, then [nw'Js ^ [vw']s and hence 

[uw']s ^ [u]s [uw'js / {u\s [vw'js / {u\s <S=^ \vvj'\s M Ms 

as \v\s ^ ["^Is. It now follows that if w' is the longest initial segment 
of w so that [nw'Js Si [n]5, then it is also the longest initial segment so 
that [vif'Js Si \u\s and moreover \aw'\s = [v'w']s is the initial vertex of the 
transition edge e (and hence belongs to the Schiitzenberger graph of [u]s)- 
Then again Green's Lemma implies [u]s = [v]s. □ 

As a corollary, we obtain a structural result due to the second author [28] . 

Theorem 4.46 (Right stabilizers). If S is a finite ^ -above A-semigroup 
stable under the TZZ -expansion, then each right stabilizer Sg is finite 
trivial and an ^-chain in S, hut not necessarily within the subsemigroup 
Ss. 

Proof. Since each element of Sg is ^-above s, it is finite by assumption on 
S and is an ^-chain in S by unambiguity of the ^-order (Theorem I4.45P . 
Moreover, if r, t are right stabilizers of s with r S? t, then sr = s = st implies 
r = t by Theorem 14.451 Thus gS is ^-trivial, completing the proof. □ 

If S = S^^, then S has unambiguous .if- and ^-orders, a property 
already acheived by S^^ [1]. 

Corollary 4.47. If S is a finite ^ -above A-semigroup stable under the 
TZB- expansion, then the and S?-orders on S are unambiguous. 

Although it is not needed later, we record a two-sided version of Theo- 
rem |4]46] (also from [28]) which is available whenever S is stable under the 
T^jB-expansion. 

Theorem 4.48 (Double stabilizers). If S is a finite ^ -above A-semigroup 
stable under the TZB -expansion, then each double stabilizer U = gSg' is finite 
^ -trivial and an ^ -chain in S but not necessarily within U. 
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Proof. Since S is stable under TZB, it is also stable under CZ and TZZ 
(Theorem I4.34p . Furthermore, gSs' is a subsemigroup of ss'Sss' so without 
loss of generality we can take s = s' and apply Theorem 14 . 46 1 and its dual. □ 

Remark 4.49 (Fixing stabilizers). If 5 is a finite ^-semigroup stable un- 
der the 7^2-expansion and the right stabilizers are not Jf-chains within 
themselves, then one way to remedy this is by repeatedly applying the 
Henckell-Schiitzenberger expansion to S. In fact, Henckell, in his thesis, 
rediscovered the Schiitzenberger product in order to prove this result. The 
precise statement goes as follows. If S* is a finite ^-semigroup, then the 
Henckell-Schiitzenberger expansion S^'' of S with k = 2^^^ has the property 
that right stabilizers of 5^* mapped into S are ^-chains within themselves. 
This was exploited by the second and third authors to show that stabilizers 
in certain relatively free profinite semigroups are .jSf-chains within them- 
selves. See [SIITT] for details. 

Another way to improve the properties of the right stabilizers is to use 
the Mal'cev expansion with respect to the locally finite variety (Zp), first 
studied by Le Saec, Pin and Weil in [22j. See also p^. 

Theorem 4.50 (Idempotent stabilizers). Let S be a finite A-semigroup 
stable under the TZZ -expansion, let p be a prime, and let T = S^^p^. If p is 
sufficiently large relative to the size of S, then each right stabilizer Tt is an 
£?-trivial band. In other words, each subsemigroup Tt satisfies the identities 
xyx = xy and x? = x. 

Combining Theorem 14.501 with Theorem 14.461 have the following, the 
major improvements being that all of the right stabilizers are now idempo- 
tents, the stabilizers form an .if-chain within themselves, and they satisfy 
an additional equation which is useful later in the article. 

Theorem 4.51 (Improved stabilizers). If S is a finite A-semigroup, p is a 
sufficiently large prime relative to the size of S'^'^ and T = 5'^B-(^p>-^6^ ^/^g^ 
each right stabilizer subsemigroup Tt is an M-trivial (also called left regular) 
band, i.e., its satisfies the identities xyx = xy and x"^ = x. Moreover, Tt 
forms an -chain within itself so that for any two elements r,s & Tt, one is 
.^-above the other, say r s, where >y now represents the .^-order in 
Tt rather than T . In addition, whenever r and s are right t stabilizers and 
f >^ s, the equation sr = s also holds. As a consequence, the .^-classes of 
Tt are left zero semigroups. 

Proof. Since 5"^^ is stable under the TZZ expansion, the right stabilizer 
subsemigroups of are ^-trivial bands. Let T' = 5^^-<^f> and let 

t' denote the image of t G T under the natural map T ^ T' . Once it is 
noticed that Tt is contained in the inverse image of T[i under this map and 
that the Mal'cev kernel of this map lies in TZB by definition, it is clear that 
Tt consists solely of idempotents. Moreover, it is ^-trivial and an ^-chain 
in T by Theorem 14.461 Since the ^-relation restricted to idempotents is 
independent of the ambient semigroup, this completes the proof. □ 
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5. The McCammond expansion revisited 

Let us investigate what happens when we apply the McCammond expan- 
sion to straighthne automata and Cayley graphs. 

Theorem 5.1. Let {^,1) be a pointed A-automaton. Then, for w S , 

the pointed A-automaton (Str (w), /) has the unique simple path property 
and is linear. 

Proof. This follows immediately from Proposition 12.401 □ 

Example 5.2 (Expanding straightline automata). Suppose that S is an A- 
semigroup and w is a word in A~^ . If Str'^(t(;) is the automaton on the left in 
Figure ET] (with the edge labels suppressed) and w is the path which passes 

through vertices l,p,q,r, s,t, and u in that order, then Str^^**^ ^'"^^ (w) 
is the automaton on the right, where we have simplified the vertex labels 
as well as leaving out the edge labels. In particular, the labels on the right 
simply indicate the vertex in Str'^(zi;) to which it is sent under the projection 
map. Similarly, if w' is the path which passes through vertices g, r', s, t, 
and u in that order, then Str'^(w') = Str'^(i(;), but Str^^*'"^^"''))" {w') / 
Str(St^^ ("'))""'(«;). Specifically, the automaton Str^^t^''^"'))*'' (u;) would be 
altered by having the edge connecting t to u starting instead at the other 
vertex labeled t to obtain Str^^*''^^'"'))''^" (t;;'). 




Figure 5.1. Expansion of a straightline automaton. 



Definition 5.3 (The M^-expansion of a semigroup). If S is a j4-semigroup, 
then S'*^'^ will denote the transition semigroup of Cay(S', vl)^^''. This is 
an object expansion on ^-generated semigroups which preserves finiteness. 
Note that Cay{S^'' , A) is not Cay{S, A)^" . In general, we will never be 
interested in Cay{S'^'' , A) but rather in the action of 5*^'' on Cay{S,A)^ . 

Example 5.4 (Not a Cayley graph). Add example that Cay(5'^^'', ^) is not 
Cay(5,yl)*^^ 

5.1. Properties of the M^-expansion. In this section we prove that for 
each ^-semigroup S, the M^-expansion of S has Mal'cev kernel contained in 
a locally finite variety of aperiodic semigroups and so in particular preserves 
being finite ^-above (Theorem I5.12p . The key step will be to show that 
the Af^-expansion of the right zero ^-semigroup is a finite band. 
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Definition 5.5 (Right zero semigroup). For each finite set A, let A'' denote 
the right zero semigroup on A (i.e., a ■ b = b for all a,b E A) and let 
Ta^ = Cay{A^ , A). Recall that by convention Cay(5, ^) is the Cayley graph 
for rather than S itself, so that F^r has |A| + 1 vertices. If we use 1 to 
denote the root vertex of Ty and define A^ = A[J{1}, then the edges of F^r 
can be described as follows. For each a G A^ and for each b G A there is an 
edge labeled b from a to b. Viewed as a function, -b is the constant function 
from A^ to A^ sending everything to b. See Figure 15. 2[ 



1 




Figure 5.2. The automaton F^r = Cay{A^,A) when A = {a,b,c}. 

The simple paths starting at 1 in F^^r have a particularly easy description: 
a word w € A* can be read as a simple path starting at 1 if and only if no 
letter occurs more than once in w. We call these words distinct letter words. 
Thus the vertices in (F^r)*^'' are in one-to-one correspondence with the 
distinct letter words. 



1 




Figure 5.3. The automaton (F^'-)^'^'' when A = {a, 6, c}. 

Definition 5.6 (Reduction). Every word w ^ A* can be read in (F^r)^"" 
starting at 1 and ending at some vertex, say v. By construction there is a 
unique simple path in this graph from 1 to v. The word read by this simple 
path is called the reduction of w and denoted Red(w). 

For example, if j4 = {a, b, c, d, e} then Red(a6a) = a, Red(a6c) = abc and 
Red{abcdce) = abce. Here are two easy observations about reductions. 
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Lemma 5.7 (Recurrence). Suppose w € A'^ is a word with first letter a. If 
a recurs in w, that is, w = auav for possibly empty words u,v G A*, then 
Red(tt)) = Red(aw). 

Proof. Reading one letter at a time it is easy to see that the path correspond- 
ing to w only passes through vertices whose label starts with a. Moreover, 
when a recurs in w, the path w returns to the vertex labeled a. Thus the 
paths corresponding to w and to av end at the same vertex and hence have 
the same reduction. □ 

The most efficient use of Lemma 15.71 would, of course, focus on the last 
occurence of o in tt) so that a does not occur in v. 

Lemma 5.8 (No recurrence). Suppose w € A'^ is a word with first letter 
a. If a does not recur in w, then w = av for some v ^ {A \ {a})* and 
Red(ii;) = aRed{v). 

Proof. Let B denote A \ {o} and notice that there is an label-preserving 
embedding of {Tb^')^^" into {Ta"^)^^" which sends the root in the domain to 
the vertex labeled a in the range. The result is an immediate consequence 
of this observation. □ 

Example 5.9. Using Lemma 15.71 and Lemma 15.81 it is easy to calculate the 
reduction of any word. For example, \i w = abacdabdbccebgfdf then 

Ked{abacdabdbccebg f df) = Red{abdbccebg f df) Lemma [5771 

= aRed{bdbccebg f df) Lemma 15.81 

= aRed{bgfdf) Lemma [5771 

= abRed{gfdf) Lemma [5^81 

= abgRed{fdf) Lemma [5^81 

= abgRed{f) Lemma [5771 

= abgf Lemma [5.81 

We can now show that (A^)^'^'' is a band. 

Lemma 5.10 (Band). If A is a finite set, then the M'^- expansion of the 
right zero semigroup A^ is a finite band. 

Proof. Note that it suffices to check that for each distinct letter word w G A* 
and for each word u € A~^, Red{wu) = Red(ti;MM) since this shows that the 
functions -u and -uu agrees on each vertex of (F^r)^''. If w is non-empty 
and the first letter of w does not occur in u, then applying Lemma 15.81 to 
both wu and wuu reduces the problem to a similar problem with a shorter 
word w. Continuing in this way, we may assume that either w is the empty 
word or that the first letter of w occurs in u. When w is empty, a single 
application of Lemma 15.71 shows that Red('U'u) = Red(u) and we are done. 
Similarly, when w is not empty and the first letter of w, say a, occurs in 
u then applying Lemma 15.71 to the last occurence of a in wu and in wuu 
produces the same result in each case. Thus they have the same reduction, 
which completes the proof. □ 
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The following example shows that, in contrast with the Cayley graphs of 
right zero semigroups, the Mal'cev kernel of S{£/^^'') — )■ S{s^)^ does not, in 
general, consist of bands. 

Example 5.11 (Not always a band). Let denote the ^-automaton shown 
on the left-hand side of Figure 15.41 and root the graph at the vertex labeled 
1. It is easy to check that ahc and {ahcf' act the same on each vertex. In 
particular, they both send 1 and r to r and they both fail when starting at 
p or q. Thus ahc is an idempotent in S{£/). On the other hand, in si^^" ^ 
shown on the right-hand side of Figure 15.41 ahc and (a6c)^ act differently 
since 1 • ahc = r' and 1 ■ (a6c)^ = r. Thus ahc no longer represents an 
idempotent in S{s^^^''). 




Figure 5.4. An ^-automaton and its expansion . 

Despite the fact that the Mal'cev kernel does not always lie in B, it is quite 
restricted. Since we only intend to apply this expansion to ^-semigroups 
stable under the 7^;B-expansion, nothing is lost if we restrict our attention to 
automata in which the label on an edge is a function of its terminal vertex 
(Lemma I4.33P . 

Theorem 5.12 (Bounding the Mal'cev kernel). If £/ = {V,A) is a pointed 
A-automaton with root I in which the label on an edge is a function of its 
terminal vertex, then there is a finite band B acting faithfully on a set X 
such that S{^^'') divides {X,B) I (y,S{£/)) where I denotes the wreath 
product of partial transformation semigroups. 

Proof. Let V denote the vertex set of and let B be the M'^-expansion 
of the ^-semigroup . By Lemma I5.10| B is a finite band. Put X = 
Simple(y). At this point, we can define a function /: Simple(i2/) X xV 
as follows. Associate to each word u = ai - ■ ■ Onoi Simple(=e/) the pair f{u) = 
((/, /fli, . . . ,/ai • • • a„_i),/n) where elements of V* are written as strings. 
The map / is injective precisely because the edge labels are determined 
by their terminal vertices, so that the word u can be completely recovered 
from the sequence of vertices it visits. To each a € ^, we associate the 
pair a = (ga, [a]s{^)) G iX,B) I {V,S{£/)) where ga'. Q ^ B is given by 
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Q9a = '(qo,) if 90 is defined and is arbitrary otherwise, li u £ Simple(i2/) 
is as above and u ■ a is defined in si/^" , then there are two cases. If ua is 
simple, then 

f{u)a = ((/, Jai, . . . , Jai • • • a„_i), /n)a 

= ((/, /ai, . . . , /ai • • • an^iju)jua) = f{u ■ a). 

Otherwise, u ■ a = I ■ ai ■ ■ ■ aj for the unique 1 < j < n — 1 so that 
one has lua = I ■ ai ■ ■ ■ aj. But then Red(/, lai, . . . , lai ■ ■ ■ an~i,Iu) = 
(/, lai , . . . , lai ■ ■ ■ aj). Thus 

f{u)a = ((/, /ai, . . . , /ai • • • a„_i), /n)a 

= (Red(/, lai, . . . , lai ■ ■ ■ Un-i, lu), lua) 

= ((/, lai, . . . , lai ■ ■ ■ aj),Iua) = f{u ■ a). 

It now follows that S{^^^^') ^ {X, B) I {V, S{£/)) (c.f. [IS]). □ 

Recall that if V and W are varieties, then the semidirect products V * W 
is contained in the Mal'cev product {CC @V)@W where we recall that CC 
is the locally finite variety defined by the identity xy = xz [32l Chapter 2]. 
Thus by Theorem ETJl the projection S^'^" S has a Mal'cev kernel which 
lies in the locally finite aperiodic variety CC @ ^? if S" is stable under the 
T^jB-expansion. Finally, Lemma [4 . 2 5 1 and Lemma [4.331 complete the proof of 
the following corollary of Theorem 15.121 

Corollary 5.13 (Bounding S^"). If S is an A-semigroup stable under 
the TZB- expansion, then the Mal'cev kernel of S^'^" — )• S lies in the lo- 
cally finite aperiodic variety CC @ B. As a result, there is an A-morphism 

Finally, we note that even though the inverse images of idempotents is 
not always a band (Example 15. lip , the torsion in the Mal'cev kernel is quite 
controlled. 

Lemma 5.14 (Bounding torsion). If S is an A-semigroup stable under the 
TZB -expansion, then the semigroups in the Mal'cev kernel of S^^" S satisfy 
the identity = x^. As a result, if S satisfies the identity = x"^~^^ for 
constants m and n, then S^" satisfies the identity x™'*'"'^ = x^™^-^)"'""' for the 
same constants m and n. 

Proof. It suffices to observe that the variety CC @ B clearly satisfies the 
identity x"^ = x^ since in any band x and x^ map to the same idempotent 
and so by definition of CC we have xx = xx'^. □ 

6. Algebraic rank function 

In this section we begin by defining an algebraic rank function. For Kleene 
expressions the rank is the nested star-height, for loop automata it is the 
maximum number of loops within loops. In the case of the Burnside semi- 
groups, B{m,n) for m > 6 and n > 1, the first author proved that the 
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automata, Kleene and algebraic definition all agree for Str(t(;) in [24J. He 
also proved that these semigroups are finite ^-above. One main idea of 
geometric semigroup theory is to run this procedure backwards. That is, 
to start with a finite ^ -above semigroup, define the rank algebraically and 
then prove it is the automata rank in many cases. This also has strong ties 
with the holonomy theorem for semigroups |29j . 

Definition 6.1 (Algebraic rank function). Let 5" be a finite ^-above A- 
semigroup and let s be one of its element. Since S is torsion, there is 
a unique idempotent in the cyclic subsemigroup generated by s which we 
denote s^. Moreover, since S is finite ^-above, there are only a finite 
number of idempotents in S which are ^-above s"^. We define the algebraic 
rank of s to be the length of the longest strictly increasing ^-chain of 
idempotents starting at s"^. This defines a map Rank^ : 5 — )• {0, 1, 2, • • • } = 
N. More specifically, 

Rank5(s) = max{r \ = cq < jr ei < jf ■ ■ ■ < jr er} 

where all Cj are idempotents in S. 

When defining a geometric rank function on Cay{S, A)'^^" , we will always 
assume that the ordering on the loops at a vertex refines the algebraic rank 
of the images in S of the words labeling the loops. 

The following lemma records some elementary properties of the algebraic 
rank function. 

Lemma 6.2 (Elementary properties). Let S be a finite ^ -above A-semi- 
group and let Rank^: 5" — )• N denote its algebraic rank function. If s and t 
are elements of S and e and f are idempotents in S, then: 

• Rank5(s) = Ranks'(s^) for all k > 1; 

• Rank5(e) > Rank5'(/) whenever e is ^ -above f; 

• Rank5(e) = Rank5'(/) wherever e and f are ^ -equivalent; 

• Rank5(e) = Rank5'(/) and e> j; f implies e= j f ; 

• Rank5(st) = Rank5(ts) for all s,t ^ S. 

Proof. The only part which is not immediate is the last one, but this follows 
from the first statement once it is noticed that (st)'^ is always ^-equivalent 
to (ts)'^. □ 

The next lemma studies the effect of ^'-maps on the rank. 

Lemma 6.3 (Rank and ^'-maps). If S —?■ T is an onto ^'-map between 
finite semigroups, then Rank5(s) = RankT('/7(s)) for all s £ S. 

Proof. By assumption the map if has the property that ip^^ of a regular 
class consists of one regular ^-class plus maybe some null elements above 
it. Hence the restriction of the ^-order to the regular ^-classes results in 
the same poset for both S and T. The fact that Rank^ = Rankj- now follows 
immediately. □ 
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On the other hand, the rank function can fah an arbitrarily large amount 
under other types of maximal proper surmorphisms |32fl34j that are not 
^'-maps. 

Remark 6.4. For example, consider the semilattice S = {0, 1,-- - ,n} with 
max. Then Ranks{i) = i. Now put S' = S L) {n'} where n'j = jn' = n for 
< J < n and n'n' = n'; so S' is obtained from S by inflating n to {n, n'}. 
Then Rank5'/(n') = 0, but its image under the natural map S' ^ S that 
identifies n with n' has rank n. This map is a maximal proper surmorphism 
of class IIIr^r in the terminology of [34j and the Mal'cev kernel of the 
surmorphism is in SC. 

Lemma 6.5 (Lifting Lemma). Let S he finite ^ -above and let ip: S —?■ T 
be an onto map. For each t £ T there exists an s G S such that (p{s^) = t^ 
and Rank5(s) > Rankj'(f). 

Proof. Suppose = cq < j ei < j ■ ■ ■ < j Cr, where < j; denotes the 
^-order in T. First choose an idempotent fr in 5 which maps to e^, then 
in SfrS choose an idempotent fr-i which maps to Cr-i, etc. This yields 
idempotents /o < ■ ■ ■ < /r in 5 with ip{fi) = Cj. Setting s = s^ = 
establishes the result. □ 

Recall that idempotents in a semigroup are partially ordered by putting 
e < / if e/ = e = /e, or equivalently, e <,%' f. 

Lemma 6.6 (Alternate definition). Let S be a finite ^ -above A-semigroup. 
The function / : 5 — > N defined by 

f{t) = max{r \ t"^ ^ cq < ei < ■ ■ ■ < e^}, 

where each ei is an idempotent, is the same as the rank function Rank5'(t). 

Proof. By an elementary lemma from [27J, if J and J' are regular ^-classes 
in a finite semigroup with J > J' , then for each idempotent e G J, there 
exists an idempotent / € J' so that e > /. Indeed, if e' = uev G J' is 
an idempotent set / = eve'ue. Then = evuevueevuevue = eve'e'e'ue = 
eve'ue = f and ufv = ueve'uev = e'e'e' = e'; so / G J' is an idempotent 
with / < e. This plus the fact that ^-equivalent idempotents have the 
same rank proves the result. □ 

The following corollary is immediate. 

Corollary 6.7 (Subsemigroups). If S is finite ^ -above and T is a sub- 
semigroup of S, then T is finite ^ -above and, for all t (zT, the inequality 
RankT(t) < Rank5(t) holds. 

The importance of the algebraic rank function is as follows. Suppose 
that S is an A-semigroup and let r/: Cay{S, A)'^^" — > Cay{S,A) be the cor- 
responding simple directed cover. Let w be a vertex of Cay{S, A)^^" . Then 
[wjs stabilizes ?y(v). Often S will already have been expanded to have idem- 
potent stabilizers. Thus the loops at v, when mapped into S, will be an 
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^-chain of idempotents and we want to choose a geometric rank function 
accordingly. More formally, we shall always choose a geometric rank func- 
tion r for Cay(S', y4)^^° so that the following occurs. If e, / are bold arrows 
ending at v and y,z are the labels of lp(e), lp(/), respectively, then we shall 
assume that if Rank5(y) < Ranks{z) then r(e) < r(/). In other words, the 
order on bold arrows at v given by the geometric rank function will be a 
topological sorting of the algebraic rank of the labels of the loops (taken in 
S). 
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